A Japanese Sentence Compaction Algorithm Based on Phrase Significance and Inter-Phrase Dependency
-
- OGURO REI
- Graduate School of Electro-Communications, The Umiversity of Electro-Communicatioms
-
- OZEKI KAZUHIKO
- Graduate School of Electro-Communications, The Umiversity of Electro-Communicatioms
-
- ZHANG YUJIE
- ATR Spoken Language Tanslation Research Laboratories
-
- TAKAGI KAZUYUKI
- Graduate School of Electro-Communications, The Umiversity of Electro-Communicatioms
Bibliographic Information
- Other Title
-
- 文節重要度と係り受け整合度に基づく日本語文簡約アルゴリズム
- ブンセツ ジュウヨウド ト カカリウケ セイゴウド ニ モトヅク ニホンゴブン カンヤク アルゴリズム
Search this article
Abstract
Conventional methods for text summarization are mostly based on the idea of selecting important sentences from a set of given sentences such as a paragraph or a whole text. Those methods have a merit that each selected sentence remains unchanged and is thus correct. However, it is sometimes necessary to shorten each sentence, when a higher compaction rate is required, or when a paragraph-by-paragraph summarization is not adequate. In such sentence compaction, it is important that a shortened sentence is natural as a Japanese sentence. In this paper, the sentence compaction problem is formulated as “a problem of selecting a subsequence of phrases from a given sentence that maximizes the sum of phrase significance scores and inter-phrase dependency scores.” Then, an efficient algorithm to solve this problem is proposed. Since this method takes inter-phrase dependency into account, a shortened sentence is expected to be grammatically correct and natural. This paper is focused on the derivation, computational complexity, and implementation issues of the algorithm, and will not discuss the matter of how to define the phrase significance score and the inter-phrase dependency score, though it will be a crucially important matter in practical applications.
Journal
-
- Journal of Natural Language Processing
-
Journal of Natural Language Processing 8 (3), 3-18, 2001
The Association for Natural Language Processing
- Tweet
Details 詳細情報について
-
- CRID
- 1390282679453458176
-
- NII Article ID
- 10021991423
- 10011861675
-
- NII Book ID
- AN10472659
-
- ISSN
- 21858314
- 13407619
-
- NDL BIB ID
- 5840969
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- Abstract License Flag
- Disallowed