Abstract

DNA microarray technology has emerged as a boon to the scientific community in understanding the growth and development of life as well as in widening their knowledge in exploring the genetic causes of anomalies occurring in the working of the human body. Microarray technology makes biologists be capable of monitoring expression of thousands of genes in a single experiment on a small chip. Extracting useful knowledge and information from these microarrays has attracted the attention of many biologists and computer scientists. Outlier is defined as an observation that deviates too much from other observations. The identification of outliers can lead to the discovery of useful and meaningful knowledge. In this paper we propose hybrid approach for outlier detection by studying the behavior of projections from the data set. We first identify patterns with abnormally low presence. Once such patterns have been identified, then the outliers are defined as those records which have such patterns present in them. We have compared time and quality aspects of our results and it is found to be better than both brute force and evolutionary algorithms. For the same data set, brute force algorithm takes more time with good quality results as compared to evolutionary algorithm, where as Evolutionary Algorithm takes less time with less quality compared to brute force algorithm.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.