Abstract

The discovery of biomarker is a critical and essential step in metabolomics research. With the increasing complexity of metabolomics data generated by high resolution instruments, it is always an urgent need for chemometricians or statisticians to develop a method to efficiently reveal informative metabolites (variables). Based on the framework of model population analysis, a strategy coupled with partial least squares discriminant analysis (PLS-DA), called revealing informative metabolites iteratively (RIMI), was proposed in this study. For the sake of considering the synergetic effect of multiple variables, a vast population of random variable combinations are generated. It is worth pointing out that only the variable combinations with higher model accuracy are used to make paired models in order to statistically assess the importance of each variable in accordance with its beneficial contribution to classification model performance. Four types of variables which include strongly informative, weakly informative, noise and interfering variables, are then identified based on the difference and its significance of the area under the receiver operating characteristic curve (AUROC) values of exclusion and inclusion of each variable. With this definition, unbeneficial variables, including noise and interfering variables, were eliminated iteratively in a mild way. Strongly and weakly informative variables regarded as beneficial variables, are retained, and their P values of t-test are used to reveal the best variable subset. Due to the advantage in exploring useful information from a vast number of variable combinations with good performance, when applied to two metabolomics datasets, RIMI has greatly improved the accuracy value of classification model compared to other methods as the results show. It is indicated that RIMI has efficiently revealed informative metabolites and is regarded as a good alternative for biomarker discovery.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.