Abstract

BackgroundIn the recent years, there has been a rise in gene expression profiling reports. Unfortunately, it has not been possible to make maximum use of available gene expression data. Many databases and programs can be used to derive the possible expression patterns of mammalian genes, based on existing data. However, these available resources have limitations. For example, it is not possible to obtain a list of genes that are expressed in certain conditions. To overcome such limitations, we have taken up a new strategy to predict gene expression patterns using available information, for one tissue at a time.ResultsThe first step of this approach involved manual collection of maximum data derived from large-scale (genome-wide) gene expression studies, pertaining to mammalian testis. These data have been compiled into a Mammalian Gene Expression Testis-database (MGEx-Tdb). This process resulted in a richer collection of gene expression data compared to other databases/resources, for multiple testicular conditions. The gene-lists collected this way in turn were exploited to derive a 'consensus' expression status for each gene, across studies. The expression information obtained from the newly developed database mostly agreed with results from multiple small-scale studies on selected genes. A comparative analysis showed that MGEx-Tdb can retrieve the gene expression information more efficiently than other commonly used databases. It has the ability to provide a clear expression status (transcribed or dormant) for most genes, in the testis tissue, under several specific physiological/experimental conditions and/or cell-types.ConclusionsManual compilation of gene expression data, which can be a painstaking process, followed by a consensus expression status determination for specific locations and conditions, can be a reliable way of making use of the existing data to predict gene expression patterns. MGEx-Tdb provides expression information for 14 different combinations of specific locations and conditions in humans (25,158 genes), 79 in mice (22,919 genes) and 23 in rats (14,108 genes). It is also the first system that can predict expression of genes with a 'reliability-score', which is calculated based on the extent of agreements and contradictions across gene-sets/studies. This new platform is publicly available at the following web address: http://resource.ibab.ac.in/MGEx-Tdb/

Highlights

  • In the recent years, there has been a rise in gene expression profiling reports

  • The genes were derived from 769 gene-sets, which in turn were collected from online resources like ArrayExpress, GEO as well as, from publications

  • While cancer-related mass scale gene expression profiling have almost been exclusive to humans, studies on hormone treatment and gene knock-outs were more common in mice

Read more

Summary

Introduction

There has been a rise in gene expression profiling reports. it has not been possible to make maximum use of available gene expression data. The main types of data considered by such databases include: Expressed Sequence Tags (ESTs) (e.g., UniGene [1]), microarray (e.g., BioGPS, earlier SymAtlas [2]), Serial Analysis of Gene Expression (SAGE) (e.g., GermSAGE [3]) and manually curated information from different small scale experiments (e.g., Human Protein Reference Database or HPRD [4]). The majority of these databases can extract and display the expression-related data for genes in multiple species, tissues and conditions. The microarray data corresponding to some studies such as the effects of androgen deprivation [12], FSH treatment [13], and testicular carcinoma [14] were not found in most of the resources

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.