Abstract

With the development of next-generation sequencing technology, massive genomic data has been generated, primarily encouraging research on cancer driver genes. Many bioinformatics methods were proposed to identify driver genes. However, the results of driver gene identification a mong these methods show considerable differences. It is still challenging to obtain a comprehensive catalog of cancer drivers. Although current methods have greatly promoted the development of driver genes, few methods can integrate the identification results of existing methods. To solve such problems in cancer driver genes research, we proposed a cascade forest model to discover cancer driver genes(CFC) that can integrate multi-omics data and annotation scores from different cancer driver gene identification algorithms. The proposed method got precise results for 33 cancer types and Pan-cancer. The CFC framework identified 275 driver genes in Pan-cancer, of which 179 were included in the Gold standard. The identified genes were enriched i n t he principal cancer signaling pathways.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call