A Method of Extracting Related Words Using Standardized Mutual Information

Masayuki Takeda, Tomohiko Sugimachi, Fumihiro Matsuo, Akira Ishino

doi:10.1007/978-3-540-39644-4_49

Techniques of automatic extraction of related words are of great importance in many applications such as query expansion and automatic thesaurus construction. In this paper, a method of extracting related words is proposed basing on the statistical information about the co-occurrences of words from huge corpora. The mutual information is one of such statistical measures and has been used for application mainly in natural language processing. A drawback is, however, the mutual information depends mainly on frequencies of words. To overcome this difficulty, we propose as a new measure a normalize deviation of mutual information. We also reveal a correspondence between word ambiguity and related words using word relation graphs constructed using this measure.

A Method of Extracting Related Words Using Standardized Mutual Information

説明

詳細情報詳細情報について

書き出し

問題の指摘

A Method of Extracting Related Words Using Standardized Mutual Information

説明

詳細情報 詳細情報について

書き出し

問題の指摘

詳細情報詳細情報について