Abstract

The analysis of biological data is a challenging problem in bioinformatics and data mining field. Given the complexity of the analysis of biological information, several methods have been proposed for analyzing this biological information in databases mostly in the form of genetic sequences and protein structures. Actually, genetic sequences are represented by matrices that indicate the expression levels of thousands of genes under several conditions. The analysis of this huge amount of data consists in extracting genes that behave similarly under certain conditions. In fact, the extracted information are sub-matrices (biclusters) that satisfy a coherence constraint. The process of extracting them is called biclustering. In this paper, we deal with biclustering problems applied to the analysis of biological data. First, a description of the problem is reviewed. Furthermore, we present a description of the divide and conquer approach that we will adopt to our algorithm for extracting biclusters. Additionally, a new evaluation function intitled Pattern Correlation Value (PCV), allowing identification of all biclusters types is proposed. Experimental results, demonstrate that the proposed methods are effective on this problem and are able to extract relevant information from the considered data.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.