A Method for Extracting Important Segments from Documents Using Support Vector Machines: Toward Automatic Text Summarization
-
- Suzuki Daisuke
- Department of Systems Engineering, The University of Electro-Communications
-
- Utsumi Akira
- Department of Systems Engineering, The University of Electro-Communications
Bibliographic Information
- Other Title
-
- Support Vector Machineを用いた文書の重要文節抽出―要約文生成に向けて―
- Support Vector Machine オ モチイタ ブンショ ノ ジュウヨウ ブンセツ チュウシュツ ヨウヤクブン セイセイ ニ ムケテ
- Toward Automatic Text Summarization
- 要約文生成に向けて
Search this article
Description
In this paper we propose an extraction-based method for automatic summarization. The proposed method consists of two processes: important segment extraction and sentence compaction. The process of important segment extraction classifies each segment in a document as important or not by Support Vector Machines (SVMs). The process of sentence compaction then determines grammatically appropriate portions of a sentence for a summary according to its dependency structure and the classification result by SVMs. To test the performance of our method, we conducted an evaluation experiment using the Text Summarization Challenge (TSC-1) corpus of human-prepared summaries. The result was that our method achieved better performance than a segment-extraction-only method and the Lead method, especially for sentences only a part of which was included in human summaries. Further analysis of the experimental results suggests that a hybrid method that integrates sentence extraction with segment extraction may generate better summaries.
Journal
-
- Transactions of the Japanese Society for Artificial Intelligence
-
Transactions of the Japanese Society for Artificial Intelligence 21 330-339, 2006
The Japanese Society for Artificial Intelligence
- Tweet
Details 詳細情報について
-
- CRID
- 1390282680084262528
-
- NII Article ID
- 10022006487
-
- NII Book ID
- AA11579226
-
- ISSN
- 13468030
- 13460714
-
- NDL BIB ID
- 8686502
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- NDL Search
- Crossref
- CiNii Articles
- OpenAIRE
-
- Abstract License Flag
- Disallowed