Abstract

The task of biomarker discovery is best translated to the machine learning task of feature ranking. Namely, the goal of biomarker discovery is to identify a set of potentially viable targets for addressing a given biological status. This is aligned with the definition of feature ranking and its goal - to produce a list of features ordered by their importance for the target concept. This differs from the task of feature selection (typically used for biomarker discovery) in that it catches viable biomarkers that have redundant or overlapping information with often highly important biomarkers, while with feature selection this is not the case. We propose to use a methodology for evaluating feature rankings to assess the quality of a given feature ranking and to discover the best cut-off point. We demonstrate the effectiveness of the proposed methodology on 10 datasets containing data about embryonal tumors. We evaluate two most commonly used feature ranking algorithms (Random forests and RReliefF) and using the evaluation methodology identifies a set of viable biomarkers that have been confirmed to be related to cancer.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.