変数間の関係性を考慮してクラスター数を決定するk-means法の改良

書誌事項

タイトル別名
  • An improved method using k-means to determine the optimal number of clusters, considering the relations between several variables
  • ヘンスウカン ノ カンケイセイ オ コウリョ シテ クラスタースウ オ ケッテイ スル k meansホウ ノ カイリョウ

この論文をさがす

抄録

In this article, we propose a non-hierarchical clustering method that can consider the relations between several variables and determine the optimal number of clusters. By utilizing the Mahalanobis distance instead of the Euclidean distance, which is calculated in k-means, we could consider the relations between several variables and obtain better groupings. Assuming that the data are samples from a mixture normal distribution, we could also calculate Akaike's information criterion (AIC) and the Bayesian information criterion (BIC) to determine the number of clusters. We used simulation and real data examples to confirm the usefulness of the proposed method. This method allows determination of the optimal number of clusters, considering the relations between several variables.

収録刊行物

  • 心理学研究

    心理学研究 82 (1), 32-40, 2011

    公益社団法人 日本心理学会

被引用文献 (1)*注記

もっと見る

参考文献 (13)*注記

もっと見る

関連プロジェクト

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ