Identification of Reliable Components in Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS): a Data-Driven Approach across Metabolic Processes.

Hiromi Motegi,Osamu Minowa,Maki Inoue,Hideaki Toki,Yuuri Tsuboi,Tetsuo Noda,Tomoko Kagami,Ayako Saga,Jun Kikuchi

doi:10.1038/srep15710

Abstract

There is an increasing need to use multivariate statistical methods for understanding biological functions, identifying the mechanisms of diseases, and exploring biomarkers. In addition to classical analyses such as hierarchical cluster analysis, principal component analysis, and partial least squares discriminant analysis, various multivariate strategies, including independent component analysis, non-negative matrix factorization, and multivariate curve resolution, have recently been proposed. However, determining the number of components is problematic. Despite the proposal of several different methods, no satisfactory approach has yet been reported. To resolve this problem, we implemented a new idea: classifying a component as “reliable” or “unreliable” based on the reproducibility of its appearance, regardless of the number of components in the calculation. Using the clustering method for classification, we applied this idea to multivariate curve resolution-alternating least squares (MCR-ALS). Comparisons between conventional and modified methods applied to proton nuclear magnetic resonance (1H-NMR) spectral datasets derived from known standard mixtures and biological mixtures (urine and feces of mice) revealed that more plausible results are obtained by the modified method. In particular, clusters containing little information were detected with reliability. This strategy, named “cluster-aided MCR-ALS,” will facilitate the attainment of more reliable results in the metabolomics datasets.

Highlights

There is an increasing need to use multivariate statistical methods for understanding biological functions, identifying the mechanisms of diseases, and exploring biomarkers
The multivariate curve resolutionalternating least squares (MCR-ALS) calculation was repeated with the number of components being changed for each calculation
For assignment of the optimum selected cluster size with objectivity, the maximum cluster size estimated from a dataset that had been randomized to destroy all biological information was set as a threshold size

Summary

Introduction

There is an increasing need to use multivariate statistical methods for understanding biological functions, identifying the mechanisms of diseases, and exploring biomarkers. “Omics” technologies, including genomics, transcriptomics, proteomics, and metabolomics/metabonomics, have been developed to obtain a bird’s-eye view of the underlying molecular networks in a cell or organism that elaborately regulate its complex biological responses[1,2] Cross-site analytical validity studies have been conducted, showing that interconvertibility of NMR data among different institutions is one of the great advantages of NMR-based approaches[11] This property is essential for the clinical application of metabolomics-derived biomarker discovery assisted by multivariate statistical approaches to the analysis of NMR datasets[12,13]. The MCR method is useful for resolving spectroscopic data featuring broad macromolecular peaks[23] and for estimating concentrations from metabolite mixture spectra[23] For use of these methods, determination of the number of components is the most important task. This inconsistency makes it difficult to use ICA/NMF/MCR, as using the wrong number of components in the analysis decreases the reliability of the results

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Scientific Reports	Publication Date: Nov 4, 2015
Citations: 49	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Identification of Reliable Components in Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS): a Data-Driven Approach across Metabolic Processes.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports

Lead the way for us

Similar Papers

THz Spectroscopic Decomposition and Analysis in Mixture Inspection Using Soft Modeling Methods
Chen Xie ... Xusheng Kang
Journal of Infrared, Millimeter, and Terahertz Waves | VOL. 42
Chen Xie, et. al.Chen Xie ... Xusheng Kang
23 Nov 2020
Journal of Infrared, Millimeter, and Terahertz Waves | VOL. 42

A Reliable Muscle Synergy Extraction Method based on Multivariate Curve Resolution-Alternating Least Squares
Yehao Ma ... A Sharifi
E3S Web of Conferences | VOL. 271
Yehao Ma, et. al.Yehao Ma ... A Sharifi
01 Jan 2020
E3S Web of Conferences | VOL. 271

Analysis of longitudinal metabolomic data using multivariate curve resolution-alternating least squares and pathway analysis
Isabel Ten-Doménech ... Julia Kuligowski
Chemometrics and Intelligent Laboratory Systems | VOL. 232
Isabel Ten-Doménech, et. al.Isabel Ten-Doménech ... Julia Kuligowski
30 Nov 2022
Chemometrics and Intelligent Laboratory Systems | VOL. 232

Interval estimation in multivariate curve resolution by exploiting the principles of error propagation in linear least squares
Ahmad Mani-Varnosfaderani ... Romà Tauler
Chemometrics and Intelligent Laboratory Systems | VOL. 206
Ahmad Mani-Varnosfaderani, et. al.Ahmad Mani-Varnosfaderani ... Romà Tauler
16 Sep 2020
Chemometrics and Intelligent Laboratory Systems | VOL. 206

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Identification of Reliable Components in Multivariate Curve Resolution-Alternating Least Squares (MCR-ALS): a Data-Driven Approach across Metabolic Processes.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Scientific Reports