Disambiguating Word Senses in Korean-Japanese Machine Translation by Using Semi-Automatically Constructed Ontology

  • KANG Sin-Jae
    Div. of Computer and Communication Engineering, Daegu Univ.
  • CHUNG You-Jin
    Div. of Electrical and Computer Engineering, Pohang Univ. of Science and Technology (POSTECH)
  • LEE Jong-Hyeok
    Div. of Electrical and Computer Engineering, Pohang Univ. of Science and Technology (POSTECH)

この論文をさがす

説明

This paper presents a method for disambiguating word senses in Korean-Japanese machine translation by using a language independent ontology. This ontology stores semantic constraints between concepts and other world knowledge, and enables a natural language processing system to resolve semantic ambiguities by making inferences with the concept network of the ontology. In order to acquire a language-independent and reasonably practical ontology in a limited time and with less manpower, we extend the existing Kadokawa thesaurus by inserting additional semantic relations into its hierarchy, which are classified as case relations and other semantic relations. The former can be obtained by converting valency information and case frames from previously-built electronic dictionaries used in machine translation. The latter can be acquired from concept co-occurrence information, which is extracted automatically from a corpus. In practical machine translation systems, our word sense disambiguation method achieved an improvement of average precision by 6.0% for Japanese analysis and by 9.2% for Korean analysis over the method without using an ontology.

収録刊行物

参考文献 (24)*注記

もっと見る

詳細情報 詳細情報について

  • CRID
    1573387452051228928
  • NII論文ID
    110006376603
  • NII書誌ID
    AA10826272
  • ISSN
    09168532
  • 本文言語コード
    en
  • データソース種別
    • CiNii Articles

問題の指摘

ページトップへ