A novel mixed integer programming for multi-biomarker panel identification by distinguishing malignant from benign colorectal tumors

Meng Zou,Peng-Jun Zhang,Xin-Yu Wen,Luonan Chen,Ya-Ping Tian,Yong Wang

doi:10.1016/j.ymeth.2015.05.011

Abstract

Multi-biomarker panels can capture the nonlinear synergy among biomarkers and they are important to aid in the early diagnosis and ultimately battle complex diseases. However, identification of these multi-biomarker panels from case and control data is challenging. For example, the exhaustive search method is computationally infeasible when the data dimension is high. Here, we propose a novel method, MILP_k, to identify serum-based multi-biomarker panel to distinguish colorectal cancers (CRC) from benign colorectal tumors. Specifically, the multi-biomarker panel detection problem is modeled by a mixed integer programming to maximize the classification accuracy. Then we measured the serum profiling data for 101 CRC patients and 95 benign patients. The 61 biomarkers were analyzed individually and further their combinations by our method.We discovered 4 biomarkers as the optimal small multi-biomarker panel, including known CRC biomarkers CEA and IL-10 as well as novel biomarkers IMA and NSE. This multi-biomarker panel obtains leave-one-out cross-validation (LOOCV) accuracy to 0.7857 by nearest centroid classifier. An independent test of this panel by support vector machine (SVM) with threefold cross validation gets an AUC 0.8438. This greatly improves the predictive accuracy by 20% over the single best biomarker. Further extension of this 4-biomarker panel to a larger 13-biomarker panel improves the LOOCV to 0.8673 with independent AUC 0.8437. Comparison with the exhaustive search method shows that our method dramatically reduces the searching time by 1000-fold. Experiments on the early cancer stage samples reveal two panel of biomarkers and show promising accuracy.The proposed method allows us to select the subset of biomarkers with best accuracy to distinguish case and control samples given the number of selected biomarkers. Both receiver operating characteristic curve and precision-recall curve show our method’s consistent performance gain in accuracy. Our method also shows its advantage in capturing synergy among selected biomarkers. The multi-biomarker panel far outperforms the simple combination of best single features. Close investigation of the multi-biomarker panel illustrates that our method possesses the ability to remove redundancy and reveals complementary biomarker combinations. In addition, our method is efficient and can select multi-biomarker panel with more than 5 biomarkers, for which the exhaustive methods fail.In conclusion, we propose a promising model to improve the clinical data interpretability and to serve as a useful tool for other complex disease studies. Our small multi-biomarker panel, CEA, IL-10, IMA, and NSE, may provide insights on the disease status of colorectal diseases.The implementation of our method in MATLAB is available via the website: http://doc.aporc.org/wiki/MILP_k.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A novel mixed integer programming for multi-biomarker panel identification by distinguishing malignant from benign colorectal tumors

Abstract

Talk to us

Similar Papers

More From: Methods

Lead the way for us

Similar Papers

Development of a Multibiomarker Panel of Healthy Eating Index in United States Adults: A Machine Learning Approach
Shuang Liang ... Michael R Skilton
The Journal of Nutrition | VOL. 153
Shuang Liang, et. al.Shuang Liang ... Michael R Skilton
20 Dec 2022
The Journal of Nutrition | VOL. 153

NCC-AUC: an AUC optimization method to identify multi-biomarker panel for cancer prognosis from genomic and clinical data
Meng Zou ... Yong Wang
Bioinformatics | VOL. 31
Meng Zou, et. al.Meng Zou ... Yong Wang
18 Jun 2015
Bioinformatics | VOL. 31

Plasma biomarkers that reflect determinants of matrix composition identify the presence of left ventricular hypertrophy and diastolic heart failure.
Michael R Zile ... Robert E Stroud
Circulation: Heart Failure | VOL. 4
Michael R Zile, et. al.Michael R Zile ... Robert E Stroud
24 Feb 2011
Circulation: Heart Failure | VOL. 4

Manufacturing Network Design for Mass Customisation using a Genetic Algorithm and an Intelligent Search Method
D Mourtzis ... F Psarommatis
Procedia CIRP | VOL. 7
D Mourtzis, et. al.D Mourtzis ... F Psarommatis
01 Jan 2013
Procedia CIRP | VOL. 7

Journal: Methods	Publication Date: May 15, 2015
Citations: 23

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A novel mixed integer programming for multi-biomarker panel identification by distinguishing malignant from benign colorectal tumors

Abstract

Talk to us

Similar Papers

More From: Methods