Abstract

AbstractThe zebrafish genome, which consists of 25 linkage groups and is ~1.4Gb in size, is being sequenced, finished and analysed in its entirety at the Wellcome Trust Sanger Institute. The manual annotation is provided by the Human and Vertebrate Analysis and Annotation (HAVANA) group and is released at regular intervals onto the Vertebrate Genome Annotation (Vega) database ("http://vega.sanger.ac.uk":http://vega.sanger.ac.uk) and may be viewed as a DAS source in Ensembl ("http://www.ensembl.org/Danio_rerio":http://www.ensembl.org/Danio_rerio). Our annotation is compiled in close collaboration with the Zebrafish Information Network (ZFIN) ("http://zfin.org/":http://zfin.org/), which has enabled us to provide an accurate, dynamic and distinct resource for the zebrafish community as a whole.Annotation is based on the reference genome sequence, which is derived from a minimal tile path assembly composed of clones that have been mapped, sequenced and meticulously finished to a sequence accuracy of over 99.9% per 100Kb. We expect to have 90% of the zebrafish genome to a finished standard by the end of 2009. Our approach to annotation uses two strategies. Firstly, the generation and annotation of gene lists comprising of cDNA (8995 in total) found in ZFIN that maps to our current reference assembly. And, secondly, by using clone by clone annotation, where we have annotated over 3200 genes, 1100 transcripts and 130 pseudogenes across 11 linkage groups and 3530 clones. As well as our on-going genome annotation we also welcome external annotation requests for specific genes and regions, which already include the annotation of 93 genes associated with human obesity and the scheduled annotation of the Major Histocompatability Complex, which will utilise reference sequence taken from libraries of a double haploid fish and complement our previous work on the human and mouse MHC already published. External requests and any feedback, questions or requests can be sent to zfish-help [at] sanger.ac.uk.

Highlights

  • J P Almeida-King, S Donaldson, G K Laird, D M Lloyd, H K Sehra, J E Collins, K Howe, B Reimholz, J Torrance, S Trevanion, D Stemple, J G R Gilbert, E Griffiths, J E Loveland, R Storey, J L Harrow, T Hubbard

  • The zebrafish genome, which consists of 25 linkage groups and is ~1.4Gb in size, is being sequenced, finished and analysed in its entirety at the Wellcome Trust Sanger Institute to provide an open source, high quality reference genome

  • The manual annotation, which is compiled in close collaboration with the Zebrafish Information Network, is provided by the Human and Vertebrate Analysis and Annotation (HAVANA) group and is frequently released onto the Vertebrate Genome Annotation (Vega) database and may be viewed as a DAS source in Ensembl

Read more

Summary

Providing Manual Annotation for the Scientific Community

J P Almeida-King, S Donaldson, G K Laird, D M Lloyd, H K Sehra, J E Collins, K Howe, B Reimholz, J Torrance, S Trevanion, D Stemple, J G R Gilbert, E Griffiths, J E Loveland, R Storey, J L Harrow, T Hubbard. The zebrafish genome, which consists of 25 linkage groups and is ~1.4Gb in size, is being sequenced, finished and analysed in its entirety at the Wellcome Trust Sanger Institute to provide an open source, high quality reference genome. The manual annotation, which is compiled in close collaboration with the Zebrafish Information Network (http://zfin.org/), is provided by the Human and Vertebrate Analysis and Annotation (HAVANA) group and is frequently released onto the Vertebrate Genome Annotation (Vega) database (http://vega.sanger.ac.uk) and may be viewed as a DAS source in Ensembl (http://www.ensembl.org/Danio_rerio). Annotation is based on the reference genome assembly sequence, which is derived from a minimal tile path composed of clones finished to a 99.9% sequence standard. The generation of gene lists composed of ZFIN cDNA that map to our current finished assembly. The latest zebrafish assembly (Zv8), which represents the most accurate assembly to date is available in Pre-Ensembl (http://pre.ensembl.org/Danio_rerio/Info/Index)

The ZMAP Annotation Interface
Building Genes Using Solexa Data
Community Directed Annotation
Solexa gene models

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.