AdaBoost/LogitBoost によるWhy テキストセグメント判定と回答抽出の自動化

田中, 克幸, 滝口, 哲也, 有木, 康雄

書誌事項

タイトル別名

AdaBoost/LogitBoost ニヨル Why テキストセグメントハンテイトカイトウチュウシュツノジドウカ
Automatic Why Text Segment Classification and Answer Extraction by Machine Learning
自然言語

この論文をさがす

抄録

従来の質問応答システムは，What，Where，Who を扱った質問に対して，事実に関係する回答を行う研究，つまりFactoid 型質問応答システムが主流である．“～はなぜ？” のように原因を求めるWhy 型や，“どのように～できる？” のような方法を探究するHow 型の質問に対応した研究例は多いとはいえない．そこで，本研究では，インターネット上にあるテキスト文書中のテキストセグメントのWhy 判定と，セグメント内の事実文と理由文の位置関係によりCase に分けた回答文の特定を，機械学習によって自動的に行う方法を提案する．Why 判定ではF 値約80%で判別可能となった．回答部分の抽出でも各クラスのF 値を向上させることができた．

Typical question-answering systems deal with factoid types, such as ‘what’, ‘where’, and ‘who’. These types of QA systems are concerned mainly with finding facts from corpus, and are thus unable to answer questions asking for reasons for some events or things. This paper presents the algorithm to find ‘Why-based’ answers from the internet. The main focus of this paper is to classify Why-based text segments and extractWhy-based answers from the segment with Cases, which are differentiated automatically by the position of the fact and reason sentence within a segment, using machine learning. The experiment showed improvement on differentiating Why-based segments from text. Also, this method enabled enhancement of F-measurement of answer extraction.

収録刊行物

情報処理学会論文誌

情報処理学会論文誌 49 (6), 2234-2242, 2008-06-15

東京 : 情報処理学会

詳細情報詳細情報について

CRID: 1050845762811412736

NII論文ID: 40019584657; 10029640327

NII書誌ID: AN00116647

ISSN: 18827764; 18827837; 03875806

NDL書誌ID: 024276504

Web Site: http://id.nii.ac.jp/1001/00009574/; https://ndlsearch.ndl.go.jp/books/R000000004-I024276504

本文言語コード: ja

資料種別: journal article

データソース種別

IRDB
NDL
CiNii Articles
KAKEN

AdaBoost/LogitBoost によるWhy テキストセグメント判定と回答抽出の自動化

書誌事項

この論文をさがす

抄録

収録刊行物

被引用文献 (1)*注記

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

AdaBoost/LogitBoost によるWhy テキストセグメント判定と回答抽出の自動化

書誌事項

この論文をさがす

抄録

収録刊行物

被引用文献 (1)*注記

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について