Document retrieval based on word's cooccurrences, the algorithum and its application.

ISHIOKA Tsunenori, KAMEDA Masayuki

doi:10.5023/jappstat.28.107

【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
【Updated on June 30, 2025】Suspension and deletion of data provided by Nikkei BP
Regarding the recording of “Research Data” and “Evidence Data”

Document retrieval based on word's cooccurrences, the algorithum and its application.

DOI Web Site 3 Citations 32 References

ISHIOKA Tsunenori

大学入試センター研究開発部情報処理研究部門
KAMEDA Masayuki

株式会社リコーソフトウェア研究所

Bibliographic Information

Other Title

単語の共起に基づく関連文書検索，算法と検索事例
タンゴノキョウキニモトヅクカンレンブンショケンサクサンポウトケンサクジレイ

Search this article

Description

異なった文書に同時に現われる単語に着目することにより,潜在的な意味的検索をおこなうDeerwester(1990)のLatent Semantic Analysisを日本語の比較的大規模な文書集合に対して適用した.その中で,大型疎行列における特異値分解アルゴリズムの比較検討を行ない,日本語文書検索に適した方法を見つけた.これを実際の新聞記事で試し,文書検索,および関連語表示において有効であることの見通しを得た.また実装する上での工夫として,関連文書検索においては,文書の大きさによる基準化が必要なことがわかった.さらに,重複を許す単語のクラスタリングを試みた.

Journal

Ouyou toukeigaku

Ouyou toukeigaku 28 (2), 107-121, 1999

Japanese Society of Applied Statistics

Citations (3)*help

References(32)*help

Keywords

Details 詳細情報について

CRID

1390282679418483328
NII Article ID

10009669119

10014848818
NII Book ID

AN00330942
DOI

10.5023/jappstat.28.107
ISSN

18838081

02850370
NDL BIB ID

4894899
Web Site

http://id.ndl.go.jp/bib/4894899

https://ndlsearch.ndl.go.jp/books/R000000004-I4894899
Text Lang

ja
Data Source
- JaLC
- NDL Search
- Crossref
- CiNii Articles
Abstract License Flag
Disallowed

Document retrieval based on word's cooccurrences, the algorithum and its application.

Bibliographic Information

Search this article

Description

Journal

Citations (3)*help

References(32)*help

Keywords

Details 詳細情報について

Export

Report a problem