Exploring Words with Semantic Correlations from Chinese Wikipedia

DOI オープンアクセス

説明

In this paper, we work on semantic correlation between Chinese words based on Wikipedia documents. A corpus with about 50,000 structured documents is generated from Wikipedia pages. Then considering of hyper-links, text overlaps and word frequency, about 300,000 word pairs with semantic correlations are explored from these documents. We roughly measure the degree of semantic correlations and find groups with tight semantic correlations by self clustering.

詳細情報 詳細情報について

問題の指摘

ページトップへ