Abstract

We have developed an integrated tool for statistical analysis of large-scale LC-MS profiles of complex protein mixtures comprising a set of procedures for data processing, selection of biomarkers used in early diagnostic and classification of patients based on their peptide mass fingerprints. Here, a novel boosting technique is proposed, which is embedded in our framework for MS data analysis. Our boosting scheme is based on Hannan-consistent game playing strategies. We analyze boosting from a game-theoretic perspective and define a new class of boosting algorithms called H-boosting methods. In the experimental part of this work we apply the new classifier together with classical and state-of-the-art algorithms to classify ovarian cancer and cystic fibrosis patients based on peptide mass spectra. The methods developed here provide automatic, general, and efficient means for processing of large scale LC-MS datasets. Good classification results suggest that our approach is able to uncover valuable information to support medical diagnosis.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call