Identification of Reliable Components in Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS): a Data-Driven Approach across Metabolic Processes

抄録

<jats:title>Abstract</jats:title><jats:p>There is an increasing need to use multivariate statistical methods for understanding biological functions, identifying the mechanisms of diseases and exploring biomarkers. In addition to classical analyses such as hierarchical cluster analysis, principal component analysis and partial least squares discriminant analysis, various multivariate strategies, including independent component analysis, non-negative matrix factorization and multivariate curve resolution, have recently been proposed. However, determining the number of components is problematic. Despite the proposal of several different methods, no satisfactory approach has yet been reported. To resolve this problem, we implemented a new idea: classifying a component as “reliable” or “unreliable” based on the reproducibility of its appearance, regardless of the number of components in the calculation. Using the clustering method for classification, we applied this idea to multivariate curve resolution-alternating least squares (MCR-ALS). Comparisons between conventional and modified methods applied to proton nuclear magnetic resonance (<jats:sup>1</jats:sup>H-NMR) spectral datasets derived from known standard mixtures and biological mixtures (urine and feces of mice) revealed that more plausible results are obtained by the modified method. In particular, clusters containing little information were detected with reliability. This strategy, named “cluster-aided MCR-ALS,” will facilitate the attainment of more reliable results in the metabolomics datasets.</jats:p>

収録刊行物

  • Scientific Reports

    Scientific Reports 5 (1), 15710-, 2015-11-04

    Springer Science and Business Media LLC

被引用文献 (7)*注記

もっと見る

参考文献 (52)*注記

もっと見る

関連プロジェクト

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ