Abstract

There are different types of fatal diseases that could possibly outspread to various parts of the body. It thus becomes obligatory to predict the existence of such anomalies, in order to prune the extent of their spread. Examining the characteristics of genes provides a deep intuition about the disease classification, as they play a vital role in influencing how an organism appears, behaves and survives in an environment. The detection of the abnormal genes could be efficiently modelled using statistical methods and machine learning approaches. Gene expression data derived from a microarray could act as an aid for this statistical computation. Microarray being a recent leap in molecular biology, provides a scope for hybridization of DNA samples that can be interpreted as values based on the gene expression level that the genome possesses. We propose an idea to select a subset of features from the huge number of samples retrieved from the gene expression profiles using Boruta feature selection algorithm. A comparative study with various supervised classification algorithms is made to categorize this subset to a normal and deviant gene. This serves to discover the most appropriate algorithm to classify the gene expression data. Hence assorting the abnormal genes in future could be accelerated with ease.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call