Abstract

The development of microarray-based high-throughput gene profiling has led to the hope that this technology could provide an efficient and accurate means of diagnosing and classifying cancers. However, the large amount of data generated by microarrays requires effective selection of informative genes for cancer classification. Key issue that needs to be addressed is a selection of small number of informative genes that contribute to a disease from the thousands of genes measured on microarrays. This work deals with finding the small subset of informative genes from gene expression microarray data which maximize the classification accuracy. We introduce an improved version of hybrid of genetic algorithm and support vector machine for genes selection and classification. We show that the classification accuracy of the proposed approach is superior to a number of current state-of-the-art methods of one widely used benchmark dataset. The informative genes from the best subset are validated and verified by comparing them with the biological results produced from biology and computer scientist researchers in order to explore the biological plausibility.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.