Compression Algorithm Of Trigram Language Models Based On Maximum Likelihood Estimation

Yodo, Norimichi, Shikano, Kiyohiro, Nakamura, Satoshi

doi:10.21437/icslp.1998-620

Compression Algorithm Of Trigram Language Models Based On Maximum Likelihood Estimation

Description

In this paper we propose an algorithm for reducing the size of back-off N-gram models, with less affecting its performance than the traditional cutoff method. The algorithm is based on the Maximum Likelihood (ML) estimation and realizes an N-gram language model with a given number of N-gram probability parameters that minimize the training set perplexity. To confirm the effectiveness of our algorithm, we apply it to trigram and bigram models, and the experiments in terms of perplexity and word error rate in a dictation system are carried out.

Journal

5th International Conference on Spoken Language Processing (ICSLP 1998)

5th International Conference on Spoken Language Processing (ICSLP 1998) 1683-1686, 1998-11

ISCA

Citations (2)*help

Details 詳細情報について

CRID

1050858784329811840
NII Article ID

10020648410

120006659340
DOI

10.21437/icslp.1998-620
HANDLE

10061/8101
Web Site

https://naist.repo.nii.ac.jp/records/4669
Text Lang

en
Article Type

conference paper
Data Source
- IRDB
- Crossref
- CiNii Articles

Compression Algorithm Of Trigram Language Models Based On Maximum Likelihood Estimation

Description

Journal

Citations (2)*help

Details 詳細情報について

Export

Report a problem