A Study of Reward Functions Suitable for Reinforcement Learning in Machine Translation
-
- NAKATANI Yuki
- Ehime University
-
- KAJIWARA Tomoyuki
- Ehime University
-
- NINOMIYA Takashi
- Ehime University
Bibliographic Information
- Other Title
-
- 機械翻訳の強化学習に適した報酬関数の調査
Description
<p>In text generation tasks such as machine translation, models are generally trained using cross-entropy loss.However, mismatches between the loss function and the evaluation metric are often problematic.It is known that this problem can be addressed by direct optimization to the evaluation metric with reinforcement learning.In machine translation, previous studies have used BLEU to calculate rewards for reinforcement learning, but BLEU is not well correlated with human evaluation.In this study, we investigate the impact on machine translation quality through reinforcement learning based on evaluation metrics that are more highly correlated with human evaluation.Experimental results show that reinforcement learning based on BERT trained on the STS task can improve various evaluation metrics.</p>
Journal
-
- Proceedings of the Annual Conference of JSAI
-
Proceedings of the Annual Conference of JSAI JSAI2022 (0), 1P4GS604-1P4GS604, 2022
The Japanese Society for Artificial Intelligence
- Tweet
Details 詳細情報について
-
- CRID
- 1390574181068834688
-
- ISSN
- 27587347
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
-
- Abstract License Flag
- Disallowed