Unsupervised Quality Estimation via Multilingual Denoising Autoencoder

Nishihara Tetsuro, Iwamoto Yuji, Yoshinaka Masato, Kajiwara Tomoyuki, Arase Yuki, Ninomiya Takashi

doi:10.5715/jnlp.29.669

Bibliographic Information

Other Title

多言語雑音除去自己符号化器による教師なし品質推定

Description

<p>Supervised quality estimation methods require a corpus that manually annotates qualities of translation outputs. To avoid such costly annotation process, previous studies have proposed unsupervised quality estimation methods based on machine translation models trained on a large-scale parallel corpora. However, these methods are not applicable to low-resource or zero-resource language pairs. This study addresses this problem by utilising a pre-trained multilingual denoising autoencoder. Specifically, the proposed method constructs a machine translation model by fine-tuning the multilingual denoising autoencoder with parallel corpora. It then estimates the translation quality as a forced-decoding probability of a translation output given its source sentence. The pre-trained denoising autoencoder captures linguistic characteristics across languages, which allows our method to evaluate translation quality of low-resource and zero-resource language pairs. Evaluation results on the WMT20 quality estimation task confirm that the proposed method achieves the best unsupervised quality estimation performance for five language pairs under the black box settings. Detailed analysis shows that the proposed method also performs well on under the zero-shot setting. </p>

Journal

Journal of Natural Language Processing

Journal of Natural Language Processing 29 (2), 669-687, 2022

The Association for Natural Language Processing

Details 詳細情報について

CRID: 1390010931108743424

DOI: 10.5715/jnlp.29.669

ISSN: 21858314; 13407619

Web Site: https://www.jstage.jst.go.jp/article/jnlp/29/2/29_669/_pdf

Text Lang: ja

Article Type: journal article

Data Source

JaLC
Crossref
KAKEN
OpenAIRE

Abstract License Flag: Disallowed

Export

Report a problem

Unsupervised Quality Estimation via Multilingual Denoising Autoencoder

Bibliographic Information

Search this article

Description

Journal

References(18)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem