Current status and new features of the Consensus Coding Sequence database

Catherine Farrell ,Mark Thomas,Kim D Pruitt,Lillian D Riddick,Craig Wallin,Shashikant Pujar,Stephen M J Searle,Marie-Marthe Suner,Laurens G Wilming,Robert Baertsch,Toby Hunt,Jeena Rajan,Bronwen Aken,David Webb,Mark Diekhans,Daniel Barrell,Jennifer Hart,Tim Hubbard,Charles A Steward,Jose M Gonzalez,Susan M Hiatt,Andrei Shkeda,Bhanu Rajput,Stephen J Trevanion,Wendy Wu,Adam Frankish,Catherine Snow,David Haussler,Kelly M Mcgarvey,Rachel A Harte,Ruth Bennett,Jonathan M Mudge,Michael R Murphy ,James Ostell ,Garth Brown ,Janet A Weber ,Jennifer Harrow ,James Gilbert ,Sanjida H Rangwala ,Jane Loveland ,Pamela A Tamez ,Nuala O’leary ,Mike Kay

doi:10.1093/nar/gkt1059

Abstract

The Consensus Coding Sequence (CCDS) project (http://www.ncbi.nlm.nih.gov/CCDS/) is a collaborative effort to maintain a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assemblies by the National Center for Biotechnology Information (NCBI) and Ensembl genome annotation pipelines. Identical annotations that pass quality assurance tests are tracked with a stable identifier (CCDS ID). Members of the collaboration, who are from NCBI, the Wellcome Trust Sanger Institute and the University of California Santa Cruz, provide coordinated and continuous review of the dataset to ensure high-quality CCDS representations. We describe here the current status and recent growth in the CCDS dataset, as well as recent changes to the CCDS web and FTP sites. These changes include more explicit reporting about the NCBI and Ensembl annotation releases being compared, new search and display options, the addition of biologically descriptive information and our approach to representing genes for which support evidence is incomplete. We also present a summary of recent and future curation targets.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Nucleic Acids Research	Publication Date: Nov 11, 2013
Citations: 142	License type: cc-by-nc

R Discovery Prime

R Discovery Prime

Current status and new features of the Consensus Coding Sequence database

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research

Lead the way for us

Similar Papers

Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.
Shashikant Pujar ...
Nucleic Acids Research | VOL. 46
Shashikant Pujar, et. al.Shashikant Pujar ...
06 Nov 2017
Nucleic Acids Research | VOL. 46

Tracking and coordinating an international curation effort for the CCDS Project
R A Harte ... K D Pruitt
Database | VOL. 2012
R A Harte, et. al.R A Harte ... K D Pruitt
20 Mar 2012
Database | VOL. 2012

The consensus coding sequence (CCDS) project: Identifying a common protein-coding gene set for the human and mouse genomes.
Kim D Pruitt ...
Genome Research | VOL. 19
Kim D Pruitt, et. al.Kim D Pruitt ...
04 Jun 2009
Genome Research | VOL. 19

Capturing the Perfect Reference Genome
Andrew S Wiecek
BioTechniques | VOL. 49
Andrew S WiecekAndrew S Wiecek
01 Sep 2010
BioTechniques | VOL. 49

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Current status and new features of the Consensus Coding Sequence database

Abstract

Talk to us

Similar Papers

More From: Nucleic Acids Research