Structural and functional-annotation of an equine whole genome oligoarray

Lauren A Bright,Fiona M Mccarthy,Shane C Burgess,Bhanu Chowdhary,Cyprianna E Swiderski

doi:10.1186/1471-2105-10-s11-s8

Lauren A Bright, Fiona M Mccarthy + Show 3 more

Open Access

https://doi.org/10.1186/1471-2105-10-s11-s8

Copy DOI

Abstract

BackgroundThe horse genome is sequenced, allowing equine researchers to use high-throughput functional genomics platforms such as microarrays; next-generation sequencing for gene expression and proteomics. However, for researchers to derive value from these functional genomics datasets, they must be able to model this data in biologically relevant ways; to do so requires that the equine genome be more fully annotated. There are two interrelated types of genomic annotation: structural and functional. Structural annotation is delineating and demarcating the genomic elements (such as genes, promoters, and regulatory elements). Functional annotation is assigning function to structural elements. The Gene Ontology (GO) is the de facto standard for functional annotation, and is routinely used as a basis for modelling and hypothesis testing, large functional genomics datasets.ResultsAn Equine Whole Genome Oligonucleotide (EWGO) array with 21,351 elements was developed at Texas A&M University. This 70-mer oligoarray was designed using the approximately 7× assembled and annotated sequence of the equine genome to be one of the most comprehensive arrays available for expressed equine sequences. To assist researchers in determining the biological meaning of data derived from this array, we have structurally annotated it by mapping the elements to multiple database accessions, including UniProtKB, Entrez Gene, NRPD (Non-Redundant Protein Database) and UniGene. We next provided GO functional annotations for the gene transcripts represented on this array. Overall, we GO annotated 14,531 gene products (68.1% of the gene products represented on the EWGO array) with 57,912 annotations. GAQ (GO Annotation Quality) scores were calculated for this array both before and after we added GO annotation. The additional annotations improved the meanGAQ score 16-fold. This data is publicly available at AgBase http://www.agbase.msstate.edu/.ConclusionProviding additional information about the public databases which link to the gene products represented on the array allows users more flexibility when using gene expression modelling and hypothesis-testing computational tools. Moreover, since different databases provide different types of information, users have access to multiple data sources. In addition, our GO annotation underpins functional modelling for most gene expression analysis tools and enables equine researchers to model large lists of differentially expressed transcripts in biologically relevant ways.

Highlights

The horse genome is sequenced, allowing equine researchers to use highthroughput functional genomics platforms such as microarrays; next-generation sequencing for gene expression and proteomics
Providing additional information about the public databases which link to the gene products represented on the array allows users more flexibility when using gene expression modelling and hypothesis-testing computational tools
Our Gene Ontology (GO) annotation underpins functional modelling for most gene expression analysis tools and enables equine researchers to model large lists of differentially expressed transcripts in biologically relevant ways

Summary

Introduction

The horse genome is sequenced, allowing equine researchers to use highthroughput functional genomics platforms such as microarrays; next-generation sequencing for gene expression and proteomics. For researchers to derive value from these functional genomics datasets, they must be able to model this data in biologically relevant ways; to do so requires that the equine genome be more fully annotated. Genomic annotation includes the demarcation of functional elements within the genomic sequence (“structural annotation”) and associating functional data with these same elements (“functional annotation”). Gov/genome/guide/gnomon.shtml combines ab initio predictions with sequence homology based upon RefSeq transcript alignments of the known genes. This structural annotation pipeline currently identifies 21,842 horse genes, and of these, 82.4% are “predicted” based upon sequence similarity with known genes from other species (as of 10/04/08). This means that these 17,997 horse genes are only listed because they are similar in sequence to genes that are already known to exist in other species

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Oct 1, 2009
Citations: 38	License type: cc-by

R Discovery Prime

R Discovery Prime

Structural and functional-annotation of an equine whole genome oligoarray

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

AgBase: a unified resource for functional analysis in agriculture
F M Mccarthy ... G B Magee
Nucleic Acids Research | VOL. 35
F M Mccarthy, et. al.F M Mccarthy ... G B Magee
29 Nov 2006
Nucleic Acids Research | VOL. 35

Gene Ontology annotation quality analysis in model eukaryotes
Teresia J Buza ... Susan M Bridges
Nucleic Acids Research | VOL. 36
Teresia J Buza, et. al.Teresia J Buza ... Susan M Bridges
10 Jan 2008
Nucleic Acids Research | VOL. 36

Gene Ontology Annotations and Resources

Nucleic Acids Research | VOL. 41

17 Nov 2012
Nucleic Acids Research | VOL. 41

The use of semantic similarity measures for optimally integrating heterogeneous Gene Ontology data from large scale annotation pipelines.
Gaston K Mazandu ... Nicola J Mulder
Frontiers in genetics | VOL. 5
Gaston K Mazandu, et. al.Gaston K Mazandu ... Nicola J Mulder
06 Aug 2014
Frontiers in genetics | VOL. 5

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Structural and functional-annotation of an equine whole genome oligoarray

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics