Consistent Data-to-Text Generation with Topic Sequences
-
- MURAKAMI Soichiro
- Tokyo Institute of Technology NTT DOCOMO, INC. National Institute of Advanced Industrial Science and Technology
-
- TANAKA Sora
- Tokyo Institute of Technology Gurunavi, Inc.
-
- HANGYO Masatsugu
- Weathernews Inc.
-
- KAMIGAITO Hidetaka
- Tokyo Institute of Technology
-
- TAKAMURA Hiroya
- Tokyo Institute of Technology National Institute of Advanced Industrial Science and Technology
-
- OKUMURA Manabu
- Tokyo Institute of Technology
Bibliographic Information
- Other Title
-
- Data-to-Textモデルにおけるトピック系列を用いた一貫性の制御
Abstract
<p>Data-to-Text is the task of generating text that describes the contents from various types of data, such as weather forecast maps and time-series stock data. With the recent advances in neural networks, the data-to-text models have been making remarkable progress in terms of the ability to capture the characteristics of such complicated data and generate text that accurately describes their contents. However, concerning the generation of a document-scale text that includes multiple sentences, the data-to-text model often generates descriptions that mention duplicated contents and lacks the consistency of their topics due to the over-generation problem. In this paper, we focus on generating descriptions having not only accuracy but also consistency regarding their contents to tackle the problem mentioned above, which often causes in the data-to-text task. We propose a method to generate consistent text by predicting a sequence of topics from data and assigning it to the generation model to control the topics and their order. In the experiment, we show that the data-to-text model produces text containing consistent topics by specifying a sequence of topics to be mentioned and that it helps to relieve the over-generation problem.</p>
Journal
-
- Proceedings of the Annual Conference of JSAI
-
Proceedings of the Annual Conference of JSAI JSAI2020 (0), 1D5GS905-1D5GS905, 2020
The Japanese Society for Artificial Intelligence
- Tweet
Details 詳細情報について
-
- CRID
- 1390285300165940096
-
- NII Article ID
- 130007856682
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- CiNii Articles
-
- Abstract License Flag
- Disallowed