Integrated analysis of gene expression by Association Rules Discovery.

Pedro Carmona-Saez,Monica Chagoyen,Alberto Pascual-Montano,Andres Rodriguez,Oswaldo Trelles,Jose M Carazo

doi:10.1186/1471-2105-7-54

Pedro Carmona-Saez, Monica Chagoyen + Show 4 more

Open Access

https://doi.org/10.1186/1471-2105-7-54

Copy DOI

Abstract

BackgroundMicroarray technology is generating huge amounts of data about the expression level of thousands of genes, or even whole genomes, across different experimental conditions. To extract biological knowledge, and to fully understand such datasets, it is essential to include external biological information about genes and gene products to the analysis of expression data. However, most of the current approaches to analyze microarray datasets are mainly focused on the analysis of experimental data, and external biological information is incorporated as a posterior process.ResultsIn this study we present a method for the integrative analysis of microarray data based on the Association Rules Discovery data mining technique. The approach integrates gene annotations and expression data to discover intrinsic associations among both data sources based on co-occurrence patterns. We applied the proposed methodology to the analysis of gene expression datasets in which genes were annotated with metabolic pathways, transcriptional regulators and Gene Ontology categories. Automatically extracted associations revealed significant relationships among these gene attributes and expression patterns, where many of them are clearly supported by recently reported work.ConclusionThe integration of external biological information and gene expression data can provide insights about the biological processes associated to gene expression programs. In this paper we show that the proposed methodology is able to integrate multiple gene annotations and expression data in the same analytic framework and extract meaningful associations among heterogeneous sources of data. An implementation of the method is included in the Engene software package.

Highlights

Microarray technology is generating huge amounts of data about the expression level of thousands of genes, or even whole genomes, across different experimental conditions
Current approaches to analyze microarray data in this line are frequently based on the application of clustering algorithms in order to establish sets of co-expressed genes
We applied our approach to the analysis of several gene expression datasets integrating different sources of biological information such as metabolic pathways, Gene Ontology annotations or transcriptional regulators

Summary

Introduction

Microarray technology is generating huge amounts of data about the expression level of thousands of genes, or even whole genomes, across different experimental conditions. DNA microarray technology is a powerful method for exploring biological processes on a genomic scale This high-throughput technique allows researchers to simultaneously monitor the expression level of thousands of genes, or even whole genomes, in a single experiment. Current approaches to analyze microarray data in this line are frequently based on the application of clustering algorithms in order to establish sets of co-expressed genes These algorithms do not incorporate available information about genes and gene products and they just take into account experimental measurements. Each set of co-clustered genes has to be further examined with the aim of discovering common biological connections among them In this way, biological information is incorporated as a subsequent process to the analysis of expression data

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Feb 7, 2006
Citations: 171	License type: cc-by

R Discovery Prime

R Discovery Prime

Integrated analysis of gene expression by Association Rules Discovery.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Abstract 419: Integrative analysis of multi-omics tumor profiles identifies pathways associated with resistance to anti-HER2 therapy in early stage breast cancer
Salendra Singh ... William Sikov
Cancer Research | VOL. 77
Salendra Singh, et. al.Salendra Singh ... William Sikov
01 Jul 2017
Cancer Research | VOL. 77

A multivariate analysis approach to the integration of proteomic and gene expression data
Ailís Fagan ... Desmond G Higgins
PROTEOMICS | VOL. 7
Ailís Fagan, et. al.Ailís Fagan ... Desmond G Higgins
01 Jun 2007
PROTEOMICS | VOL. 7

Comparative Analysis of Different Label-Free Mass Spectrometry Based Protein Abundance Estimates and Their Correlation with RNA-Seq Gene Expression Data
Kang Ning ... Damian Fermin
Journal of Proteome Research | VOL. 11
Kang Ning, et. al.Kang Ning ... Damian Fermin
29 Feb 2012
Journal of Proteome Research | VOL. 11

Platelet-derived Growth Factor Stimulates Src-dependent mRNA Stabilization of Specific Early Genes in Fibroblasts
Paul A Bromann ... Sara A Courtneidge
Journal of Biological Chemistry | VOL. 280
Paul A Bromann, et. al.Paul A Bromann ... Sara A Courtneidge
01 Mar 2005
Journal of Biological Chemistry | VOL. 280

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Integrated analysis of gene expression by Association Rules Discovery.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics