Using acoustic dissimilarity measures based on state-level distance vector representation for improved spoken term detection

Naoki Yamamoto, Atsuhiko Kai

doi:10.1109/apsipa.2013.6694151

Using acoustic dissimilarity measures based on state-level distance vector representation for improved spoken term detection

Description

This paper proposes a simple approach to subword-based spoken term detection (STD) which uses improved acoustic dissimilarity measures based on a distance-vector representation at the state-level. Our approach assumes that both the query term and spoken documents are represented by subword units and then converted to the sequence of HMM states. A set of all distributions in subword-based HMMs is used for generating distance-vector representation of each state of all subword units. The element of a distance-vector corresponds to the distance between distributions of two different states, and thus a vector represents a structural feature at the state-level. The experimental result showed that the proposed method significantly outperforms the baseline method, which employs a conventional acoustic dissimilarity measure based on subword unit, with very little increase in the required search time.

Journal

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 1-4, 2013-10

IEEE

References(14)*help

Related Projects

Details 詳細情報について

CRID

1360848660084400384
DOI

10.1109/apsipa.2013.6694151
Web Site

http://xplorestaging.ieee.org/ielx7/6682637/6694103/06694151.pdf?arnumber=6694151
Article Type

journal article
Data Source
- Crossref
- KAKEN
- OpenAIRE

Using acoustic dissimilarity measures based on state-level distance vector representation for improved spoken term detection

Description

Journal

References(14)*help

Related Projects

Details 詳細情報について

Export

Report a problem