-
- WANG Xiaoxuan
- School of Computer Science, Northwestern Polytechnical University
-
- XIE Lei
- School of Computer Science, Northwestern Polytechnical University
-
- LU Mimi
- School of Computer Science, Northwestern Polytechnical University
-
- MA Bin
- Institute for Infocomm Research
-
- CHNG Eng Siong
- School of Computer Engineering, Nanyang Technological University
-
- LI Haizhou
- Institute for Infocomm Research
書誌事項
- タイトル別名
-
- Broadcast News Story Segmentation Using Conditional Random Fields and Multimodal Features
- セレクト オヨビ マージ チョウテンスウ ノ サイショウカ ニ ヨル パイプラインカ イソンセイ グラフ ノ カンタンカ
この論文をさがす
説明
In this paper, we propose integration of multimodal features using conditional random fields (CRFs) for the segmentation of broadcast news stories. We study story boundary cues from lexical, audio and video modalities, where lexical features consist of lexical similarity, chain strength and overall cohesiveness; acoustic features involve pause duration, pitch, speaker change and audio event type; and visual features contain shot boundaries, anchor faces and news title captions. These features are extracted in a sequence of boundary candidate positions in the broadcast news. A linear-chain CRF is used to detect each candidate as boundary/non-boundary tags based on the multimodal features. Important interlabel relations and contextual feature information are effectively captured by the sequential learning framework of CRFs. Story segmentation experiments show that the CRF approach outperforms other popular classifiers, including decision trees (DTs), Bayesian networks (BNs), naive Bayesian classifiers (NBs), multilayer perception (MLP), support vector machines (SVMs) and maximum entropy (ME) classifiers.
収録刊行物
-
- IEICE Transactions on Information and Systems
-
IEICE Transactions on Information and Systems E95.D (5), 1206-1215, 2012
一般社団法人 電子情報通信学会
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1390282679354417024
-
- NII論文ID
- 10030942437
- 110009444737
-
- NII書誌ID
- AA12099634
-
- ISSN
- 18804535
- 17451361
- 09168532
-
- BIBCODE
- 2012IEITI..95.1206W
-
- NDL書誌ID
- 023751202
-
- 本文言語コード
- en
-
- データソース種別
-
- JaLC
- NDL
- Crossref
- CiNii Articles
- KAKEN
-
- 抄録ライセンスフラグ
- 使用不可