Abstract
<p>Background: Genome-wide expression data of gene microarrays can be used to infer gene networks. At a cellular level, a gene network provides a picture of the modules in which genes are densely connected, and of the hub genes, which are highly connected with other genes. A gene network is useful to identify the genes involved in the same pathway, in a protein complex or that are co-regulated. In this study, we used different methods to find gene networks in the ciliate Tetrahymena thermophila, and describe some important properties of this network, such as modules and hubs. Methodology/Principal Findings: Using 67 single channel microarrays, we constructed the Tetrahymena gene network (TGN) using three methods: the Pearson correlation coefficient (PCC), the Spearman correlation coefficient (SCC) and the context likelihood of relatedness (CLR) algorithm. The accuracy and coverage of the three networks were evaluated using four conserved protein complexes in yeast. The CLR network with a Z-score threshold 3.49 was determined to be the most robust. The TGN was partitioned, and 55 modules were found. In addition, analysis of the arbitrarily determined 1200 hubs showed that these hubs could be sorted into six groups according to their expression profiles. We also investigated human disease orthologs in Tetrahymena that are missing in yeast and provide evidence indicating that some of these are involved in the same process in Tetrahymena as in human. Conclusions/Significance: This study constructed a Tetrahymena gene network, provided new insights to the properties of this biological network, and presents an important resource to study Tetrahymena genes at the pathway level. </p>
Highlights
Analysis of the T. thermophila macronuclear genome sequence has identified 58 Tetrahymena orthologs of human disease genes that are missing in yeast [19], and we focused our analysis on these genes
The correlation coefficient was used as the cutoff value for Pearson and Spearman correlation methods, and the Z-score was used for the context likelihood of relatedness (CLR) method
For the two correlation methods, the edge number for the Pearson method was greater than the Spearman method with the same accuracy, suggesting a higher false positive rate for the Pearson correlation coefficient (PCC) method
Summary
High throughput gene expression data as generated by DNA microarray technology provides insight into the behavior of individual genes under various conditions [1]. Network analysis can be used to identify related biological processes or pathways at the cellular level, which are manifested in the form of modules in the gene network. The hub that represents the genes highly connected with others in a network, is an important property of a scale free network and is of great biological significance [5]. Genome-wide expression data of gene microarrays can be used to infer gene networks. A gene network provides a picture of the modules in which genes are densely connected, and of the hub genes, which are highly connected with other genes. We used different methods to find gene networks in the ciliate Tetrahymena thermophila, and describe some important properties of this network, such as modules and hubs
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.