Abstract

Comparing genomic properties of different organisms is of fundamental importance in the study of biological and evolutionary principles. Although differences among organisms are often attributed to differential gene expression, genome-wide comparative analysis thus far has been based primarily on genomic sequence information. We present a comparative study of large datasets of expression profiles from six evolutionarily distant organisms: S. cerevisiae, C. elegans, E. coli, A. thaliana, D. melanogaster, and H. sapiens. We use genomic sequence information to connect these data and compare global and modular properties of the transcription programs. Linking genes whose expression profiles are similar, we find that for all organisms the connectivity distribution follows a power-law, highly connected genes tend to be essential and conserved, and the expression program is highly modular. We reveal the modular structure by decomposing each set of expression data into coexpressed modules. Functionally related sets of genes are frequently coexpressed in multiple organisms. Yet their relative importance to the transcription program and their regulatory relationships vary among organisms. Our results demonstrate the potential of combining sequence and expression data for improving functional gene annotation and expanding our understanding of how gene expression and diversity evolved.

Highlights

  • Microarray experiments are being used to address a large diversity of biological issues

  • Gene expression analysis can provide functional links for new ORFs based on their coexpression with known genes

  • Comparing genomic properties of different organisms is of fundamental importance in the study of biological and evolutionary principles

Read more

Summary

Introduction

As evolutionary changes frequently involve modifications of the gene regulatory program (Carroll 2000; True and Carroll 2002; Wray et al 2003), integration of expression data into interspecies comparative analyses could potentially provide new insights into the relation between genomic sequence and organismal form and function Such an approach has been mostly applied to small numbers of genes (Carroll 2000; True and Carroll 2002; Wray et al 2003) or has been restricted to variations in the genome-wide expression profiles during the development of closely related species (Rifkin et al 2003). With the accumulation of large-scale expression data for a number of diverse species, the time may be ripe for a macroevolutionary comparison of gene expression

Methods
Results
Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.