Multinomial PCA for extracting major latent topics from document streams

M. Kimura, K. Saito, N. Ueda

doi:10.1109/ijcnn.2005.1555836

Multinomial PCA for extracting major latent topics from document streams

説明

We propose a new unsupervised learning method called multinomial PCA (MuPCA) for efficiently extracting the major latent topics from a document stream based on the "bag-of-words" (BOW) representation of a document. Unlike PCA, MuPCA follows a suitable probabilistic generative model for the document stream represented as time-series of word-frequency vectors. Using real data of document streams on the Web, we experimentally demonstrate the effectiveness of the proposed method.

収録刊行物

Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005.

Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005. 2 238-243, 2006-01-05

IEEE

被引用文献 (1)*注記

CRID

1360866225177746816
DOI

10.1109/ijcnn.2005.1555836
Web Site

http://xplorestaging.ieee.org/ielx5/10421/33089/01555836.pdf?arnumber=1555836
データソース種別
- Crossref
- OpenAIRE

書き出し

問題の指摘

ページトップへ

Multinomial PCA for extracting major latent topics from document streams

説明

収録刊行物

被引用文献 (1)*注記

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について