Unsupervised clustering of utterances using non-parametric Bayesian methods

説明

Unsupervised clustering of utterances can be useful for the modeling of dialogue acts for dialogue applications. Previously, the Chinese restaurant process (CRP), a non-parametric Bayesian method, has been introduced and has shown promising results for the clustering of utterances in dialogue. This paper newly introduces the infinite HMM, which is also a nonparametric Bayesian method, and verifies its effectiveness. Experimental results in two dialogue domains show that the infinite HMM, which takes into account the sequence of utterances in its clustering process, significantly outperforms the CRP. Although the infinite HMM outperformed other methods, we also found that clustering complex dialogue data, such as humanhuman conversations, is still hard when compared to humanmachine dialogues. Index Terms: Unsupervised clustering, Nonparametric Bayesian methods, Chinese restaurant process, Infinite HMM

収録刊行物

詳細情報 詳細情報について

問題の指摘

ページトップへ