Abstract

With respect to DNA microarray has produced massive amounts of gene expression data with high dimension in recent years, gene selection is one of the bottlenecks of gene expression datasets analysis. This paper presents a framework for unsupervised gene selection based on dependency maximisation between the samples similarity matrices of before and after deleting a gene, using a novel estimation of the Hilbert-Schmidt independence criterion (HSIC). The key idea is that elimination of genes which are redundant and/or have much relevancy with other genes does not have much effect on pairwise samples similarity. Also, to deal with diagonally dominant matrices, the dynamic range of matrix values is reduced. Additionally, gap statistic and k-means clustering methods are used to increase the speed of proposed methods. Experimental validation is conducted on several microarray gene expression datasets and the results show that our gene selection scheme works well in practice.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.