Comparison between Pearson Correlation Coefficient and Mutual Information as a Similarity Measure of Gene Expression Profiles

Bibliographic Information

Other Title
  • 遺伝子発現プロファイル類似度としてのピアソン相関係数と相互情報量の比較
  • イデンシ ハツゲン プロファイル ルイジド ト シテ ノ ピアソン ソウカン ケイスウ ト ソウゴ ジョウホウリョウ ノ ヒカク

Search this article

Abstract

Definition of similarity is required for clustering co-expressed genes or estimating gene regulatory network from gene expression data. Pearson correlation coefficient and mutual information are the popular measures to evaluate similarity between gene expression profiles. To investigate which measure is appropriate for evaluating similarity between gene expression profiles, we have compared these two measures using Gene ontology annotation similarity. Genes that have similar Gene ontology annotations can be interpreted that they have commonality in biological processes or molecular functions. The results showed that the better similarity measure is different depending on the purpose of the analysis or from which organism the data derived. In the case of evaluating similarities among more than three genes, mutual information was a better similarity measure for the data derived from multicellular organisms, though Pearson correlation coefficient was a better similarity measure for the data derived from unicellular organisms. In the case of finding genes whose transcripts have similar functions or genes that participate to similar processes, Pearson correlation coefficient was always a better measure.

Journal

References(45)*help

See more

Details 詳細情報について

Report a problem

Back to top