Abstract
Over the last decades, molecular signatures have attracted extensive attention in cancer research. However, most of the reported biomarkers show a weak distinguishing ability in predicting the survival risks of patients. Actually, univariate analysis is generally considered in regression analysis, which makes the existing statistical methods ineffective. Furthermore, there is too much human involvement in the ways of classifying patients with high and low risk. Last but not least, the participation of therapy after conservative surgery also makes the survival analysis more complex. In order to solve these problems, we propose a solid method of feature selection which combines top-down and bottom-up strategies. The top-down strategy is to randomly extract some genes each time and select candidate genes through cumulative voting. The bottom-up strategy is to fully enumerate the selected genes and to use a clustering algorithm to classify samples. We analyzed glioblastoma data from the Cancer Genome Atlas (TCGA) and got candidate signatures. The results of simulation data, as well as an independent test set the Chinese Glioma Genome Atlas (CGGA), verified the reliability of the method and validity of the selected features.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.