パラグラフ間の差異を考慮したquery-biasedな要約手法

  • 大谷 力
    早稲田大学理工学術院大学院情報生産システム研究科
  • 文 景厚
    (株)日立製作所 ソフトウェア事業部
  • 織田 泰司
    早稲田大学情報生産システム研究センター
  • 古江 敏彦
    九州電力(株) 総合研究所 環境・化学グループ
  • 内田 佳孝
    九州電力(株) 総合研究所 環境・化学グループ
  • 吉江 修
    早稲田大学理工学術院大学院情報生産システム研究科

書誌事項

タイトル別名
  • Query-biased Summarization Considering Difference of Paragraphs
  • パラグラフ カン ノ サイ オ コウリョ シタ query biased ナ ヨウヤク シュホウ

この論文をさがす

抄録

Most existing query-biased summarization methods generate the summary using extracted sentences based on similarity measure between all sentences in documents and the query. If there are plural sentences having high similarity to the query in the documents, however, these methods cannot decide from which sentence the summary should be made. This paper proposes an algorithm considering difference of paragraphs, adopting new indicator that shows the difference between one paragraph and the others. In a word space composed of all words in the target document, the algorithm determines the axis that maximizes the difference when a paragraph and the others are projected onto it. There are many combinations of a paragraph and a set of other paragraphs. For each combination, the above-mentioned axis that maximizes the difference and gives a conformity degree to the given query is calculated. With these conformities, the algorithm decides one paragraph for generating the summary. To obtain the axes, topic distinctiveness factor analysis is applied. The basic idea for making final summary is concatenating the sentences extracted from the paragraph. The resultant summary is evaluated from the points of readability, understandability and the easiness to judge whether the link works well or not.

収録刊行物

参考文献 (14)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ