Hierarchical Latent Alignment for Non-Autoregressive Generation under High Compression Ratio

XU Wang, MA Yongliang, CHEN Kehai, ZHOU Ming, YANG Muyun, ZHAO Tiejun

doi:10.1587/transinf.2023edp7111

抄録

<p>Non-autoregressive generation has attracted more and more attention due to its fast decoding speed. Latent alignment objectives, such as CTC, are designed to capture the monotonic alignments between the predicted and output tokens, which have been used for machine translation and sentence summarization. However, our preliminary experiments revealed that CTC performs poorly on document abstractive summarization, where a high compression ratio between the input and output is involved. To address this issue, we conduct a theoretical analysis and propose Hierarchical Latent Alignment (HLA). The basic idea is a two-step alignment process: we first align the sentences in the input and output, and subsequently derive token-level alignment using CTC based on aligned sentences. We evaluate the effectiveness of our proposed approach on two widely used datasets XSUM and CNNDM. The results indicate that our proposed method exhibits remarkable scalability even when dealing with high compression ratios.</p>

収録刊行物

IEICE Transactions on Information and Systems

IEICE Transactions on Information and Systems E107.D (3), 411-419, 2024-03-01

一般社団法人電子情報通信学会

キーワード

詳細情報詳細情報について

CRID: 1390299318868323200

DOI: 10.1587/transinf.2023edp7111

ISSN: 17451361; 09168532

Web Site: https://www.jstage.jst.go.jp/article/transinf/E107.D/3/E107.D_2023EDP7111/_pdf

本文言語コード: en

データソース種別

JaLC
Crossref

抄録ライセンスフラグ: 使用不可

Hierarchical Latent Alignment for Non-Autoregressive Generation under High Compression Ratio

抄録

収録刊行物

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

Hierarchical Latent Alignment for Non-Autoregressive Generation under High Compression Ratio

抄録

収録刊行物

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について