A discriminative HMM/N-gram-based retrieval approach for mandarin spoken documents

Berlin Chen, Hsin-Min Wang, Lin-Shan Lee

doi:10.1145/1034780.1034784

【Updated on May 12, 2025】 Integration of CiNii Dissertations and CiNii Books into CiNii Research
Trial version of CiNii Research Knowledge Graph Search feature is available on CiNii Labs
【Updated on June 30, 2025】Suspension and deletion of data provided by Nikkei BP
Regarding the recording of “Research Data” and “Evidence Data”

A discriminative HMM/N-gram-based retrieval approach for mandarin spoken documents

DOI Web Site 1 Citations

Berlin Chen

National Taiwan Normal University, Taipei, Taiwan
Hsin-Min Wang

Academia Sinica, Taipei, Taiwan
Lin-Shan Lee

National Taiwan University, Taipei, Taiwan

Search this article

CiNii Books

Description

<jats:p>In recent years, statistical modeling approaches have steadily gained in popularity in the field of information retrieval. This article presents an HMM/N-gram-based retrieval approach for Mandarin spoken documents. The underlying characteristics and the various structures of this approach were extensively investigated and analyzed. The retrieval capabilities were verified by tests with word- and syllable-level indexing features and comparisons to the conventional vector-space model approach. To further improve the discrimination capabilities of the HMMs, both the expectation-maximization (EM) and minimum classification error (MCE) training algorithms were introduced in training. Fusion of information via indexing word- and syllable-level features was also investigated. The spoken document retrieval experiments were performed on the Topic Detection and Tracking Corpora (TDT-2 and TDT-3). Very encouraging retrieval performance was obtained.</jats:p>

Journal

ACM Transactions on Asian Language Information Processing

ACM Transactions on Asian Language Information Processing 3 (2), 128-145, 2004-06

Association for Computing Machinery (ACM)

Citations (1)*help

Keywords

General Computer Science

Details 詳細情報について

CRID

1361699995440284544
DOI

10.1145/1034780.1034784
ISSN

15583430

15300226
Web Site

https://dl.acm.org/doi/pdf/10.1145/1034780.1034784
Data Source
- Crossref

A discriminative HMM/N-gram-based retrieval approach for mandarin spoken documents

Search this article

Description

Journal

Citations (1)*help

Keywords

Details 詳細情報について

Export

Report a problem