Properties of Semantic Spaces with Respect to the Types of Semantic Relations between Words: Influence of Corpus, Construction Method, and Context Size

  • Akiyama Satoshi
    Department of System Engineering, The University of Elector-Communications
  • Utsumi Akira
    Department of System Engineering, The University of Elector-Communications

Bibliographic Information

Other Title
  • 概念間の関係に関する単語の意味空間の性質--コーパス,構築手法,文章単位による影響--
  • ガイネン カン ノ カンケイ ニ カンスル タンゴ ノ イミ クウカン ノ セイシツ コーパス コウチク シュホウ ブンショウ タンイ ニ ヨル エイキョウ

Search this article

Description

A semantic space model provides a framework of semantic representation. In this model, each word is represented by a high-dimensional vector and the degree of semantic similarity between any two words can be easily computed as the cosine of the angle formed by their vectors. Recently, a number of methods have been proposed for constructing semantic spaces, but little has been known about the properties of different semantic spaces, in particular what kinds of semantic relations can be represented by what kinds of semantic spaces. In this study, we constructed fourteen different semantic spaces using three corpora (i.e., Japanese newspaper articles, Japanese novels, and Japanese dictionary), two construction methods (i.e., term frequency (TF) and term cooccurrence (CO)) and three context-window sizes (i.e., article, paragraph, and sentence). We then examined the properties of these spaces by comparing the ability to represent three semantic relations (i.e., coordination⁄synonymy, superordination, and collocation) and their eight subrelations. As a result, we demonstrated that, regardless of construction method and window size, the coordination⁄synonymy relation was better represented by the dictionary-based semantic spaces, but the collocation relation was better represented by the newspaper- and TF-based spaces. We also found that the superordination relation was better represented by the TF-based spaces with paragraphs as a window size, and corpus difference between dictionary and newspaper did not affect the representational ability of superordination. In addition, we investigated the effects of dimensionality reduction by singular value decomposition. The overall result was that the performance in predicting word association was degraded, but the performance of typicality judgment for the coordination⁄synonymy relation was improved by dimentionality reduction.

Journal

Citations (1)*help

See more

References(22)*help

See more

Related Projects

See more

Details 詳細情報について

Report a problem

Back to top