RefSeq: an update on mammalian reference sequences

Kim D Pruitt,Kelly M Mcgarvey,Raymond E Tully,Michael R Murphy,Olga Ermolaeva,David Webb,Hanzhen Sun,Jennifer Hart,Shashikant Pujar,Craig Wallin,Paul Kitts,Michael Dicuccio,Andrei Shkeda,Lillian D Riddick,Wendy Wu,Bhanu Rajput,Susan M Hiatt,Alexander Astashyn,Sanjida H Rangwala ,Garth Brown ,Melissa Landrum ,Catherine Farrell ,Donna Maglott ,Nuala O’leary ,Terence Murphy ,Janet A Weber ,Pamela A Tamez ,Françoise Thibaud‐Nissen ,James Ostell

doi:10.1093/nar/gkt1114

Abstract

The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of annotated genomic, transcript and protein sequence records derived from data in public sequence archives and from computation, curation and collaboration (http://www.ncbi.nlm.nih.gov/refseq/). We report here on growth of the mammalian and human subsets, changes to NCBI’s eukaryotic annotation pipeline and modifications affecting transcript and protein records. Recent changes to NCBI’s eukaryotic genome annotation pipeline provide higher throughput, and the addition of RNAseq data to the pipeline results in a significant expansion of the number of transcripts and novel exons annotated on mammalian RefSeq genomes. Recent annotation changes include reporting supporting evidence for transcript records, modification of exon feature annotation and the addition of a structured report of gene and sequence attributes of biological interest. We also describe a revised protein annotation policy for alternatively spliced transcripts with more divergent predicted proteins and we summarize the current status of the RefSeqGene project.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nucleic Acids Research	Publication Date: Nov 19, 2013
Citations: 923	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

RefSeq: an update on mammalian reference sequences

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research

Lead the way for us

Similar Papers

Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.
...
Nucleic Acids Research | VOL. 44
, et. al. ...
08 Nov 2015
Nucleic Acids Research | VOL. 44

NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy
K D Pruitt ... G R Brown
Nucleic Acids Research | VOL. 40
K D Pruitt, et. al.K D Pruitt ... G R Brown
24 Nov 2011
Nucleic Acids Research | VOL. 40

NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins
K D Pruitt
Nucleic Acids Research | VOL. 33
K D PruittK D Pruitt
17 Dec 2004
Nucleic Acids Research | VOL. 33

EcoGene-RefSeq: EcoGene tools applied to the RefSeq prokaryotic genomes
Jindan Zhou ... Andrew J Richardson
Bioinformatics | VOL. 29
Jindan Zhou, et. al.Jindan Zhou ... Andrew J Richardson
04 Jun 2013
Bioinformatics | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

RefSeq: an update on mammalian reference sequences

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research