- 【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
- Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
- 【Updated on June 30, 2025】Suspension and deletion of data provided by Nikkei BP
- Regarding the recording of “Research Data” and “Evidence Data”
Word Translation by Combining an Example-Based Method and Machine Learning Models.
-
- UCHIMOTO KIYOTAKA
- Communications Research Laboratory
-
- SEKINE SATOSHI
- Computer Science Department, New York University
-
- MURATA MASAKI
- Communications Research Laboratory
-
- ISAHARA HITOSHI
- Communications Research Laboratory
Bibliographic Information
- Other Title
-
- 用例に基づく手法と機械学習モデルの組み合せによる訳語選択
- ヨウレイ ニ モトヅク シュホウ ト キカイ ガクシュウ モデル ノ クミアワセ ニ ヨル ヤクゴ センタク
Search this article
Description
We describe the method for word selection in machine translation. Given an input sentence and a target word in the sentence, our system first estimates the similarity between the input sentence and parallel example sets called “Translation Memory.” It then selects an appropriate translation of the target word by using the example set with the highest similarity. The similarity is calculated using an example-based method and a machine learning model, which assesses the similarity based on the similarity of a string, words to the left and right of the target word in the input sentence, frequencies of content words of the input sentence and those of their translation candidates in bilingual and monolingual corpora in English and Japanese. Given an input sentence and a target word in the sentence, an example-based method is applied to them in the first step. Then, if an appropriate example set is not found, a machine learning model is applied to them. The most appropriate machine learning model is selected for each target word from several machine learning models by a certain method such as cross-validation on the training data. In this paper, we show the advantage of our method and also show that what kinds of information contributed to improving the accuracy based on the results of the second contest on word sense disambiguation, SENSEVAL-2, which was held in Spring, 2001.
Journal
-
- Journal of Natural Language Processing
-
Journal of Natural Language Processing 10 (3), 87-114, 2003
The Association for Natural Language Processing
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1390282679452988800
-
- NII Article ID
- 130004101374
- 80015972154
-
- NII Book ID
- AN10472659
-
- ISSN
- 21858314
- 13407619
-
- NDL BIB ID
- 6560238
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- NDL Search
- Crossref
- CiNii Articles
-
- Abstract License Flag
- Disallowed