CDMPred: a tool for predicting cancer driver missense mutations with high-quality passenger mutations.

Lihua Wang,Haiyang Sun,Zhenyu Yue,Junfeng Xia,Xiaoyan Li

doi:10.7717/peerj.17991

Abstract

Most computational methods for predicting driver mutations have been trained using positive samples, while negative samples are typically derived from statistical methods or putative samples. The representativeness of these negative samples in capturing the diversity of passenger mutations remains to be determined. To tackle these issues, we curated a balanced dataset comprising driver mutations sourced from the COSMIC database and high-quality passenger mutations obtained from the Cancer Passenger Mutation database. Subsequently, we encoded the distinctive features of these mutations. Utilizing feature correlation analysis, we developed a cancer driver missense mutation predictor called CDMPred employing feature selection through the ensemble learning technique XGBoost. The proposed CDMPred method, utilizing the top 10 features and XGBoost, achieved an area under the receiver operating characteristic curve (AUC) value of 0.83 and 0.80 on the training and independent test sets, respectively. Furthermore, CDMPred demonstrated superior performance compared to existing state-of-the-art methods for cancer-specific and general diseases, as measured by AUC and area under the precision-recall curve. Including high-quality passenger mutations in the training data proves advantageous for CDMPred's prediction performance. We anticipate that CDMPred will be a valuable tool for predicting cancer driver mutations, furthering our understanding of personalized therapy.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

CDMPred: a tool for predicting cancer driver missense mutations with high-quality passenger mutations.

Abstract

Talk to us

Similar Papers

More From: PeerJ

Lead the way for us

Journal: PeerJ	Publication Date: Jan 1, 2024
License type: cc-by

Similar Papers

Exploring preferred amino acid mutations in cancer genes: Applications to identify potential drug targets
P Anoosha ... M Michael Gromiha
Biochimica et Biophysica Acta (BBA) - Molecular Basis of Disease | VOL. 1862
P Anoosha, et. al.P Anoosha ... M Michael Gromiha
12 Nov 2015
Biochimica et Biophysica Acta (BBA) - Molecular Basis of Disease | VOL. 1862

Sequence Neighborhoods Enable Reliable Prediction of Pathogenic Mutations in Cancer Genomes.
Shayantan Banerjee ... Karthik Raman
Cancers | VOL. 13
Shayantan Banerjee, et. al.Shayantan Banerjee ... Karthik Raman
14 May 2021
Cancers | VOL. 13

Abstract 24: A genetic model of metastatic evolution: Driver and passenger mutations affect metastatic fitness
Christopher D Mcfarland ... Jacob G Scott
Cancer Research | VOL. 71
Christopher D Mcfarland, et. al.Christopher D Mcfarland ... Jacob G Scott
15 Apr 2011
Cancer Research | VOL. 71

Cancer initiation with epistatic interactions between driver and passenger mutations
Benedikt Bauer ... Arne Traulsen
Journal of Theoretical Biology | VOL. 358
Benedikt Bauer, et. al.Benedikt Bauer ... Arne Traulsen
20 May 2014
Journal of Theoretical Biology | VOL. 358

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

CDMPred: a tool for predicting cancer driver missense mutations with high-quality passenger mutations.

Abstract

Talk to us

Similar Papers

More From: PeerJ