Gene: a gene-centered information resource at NCBI.

Garth R Brown,Michael Ovetsky,Olga Ermolaeva,Kenneth S Katz,Vichet Hem,Igor Tolstoy,Craig Wallin,Donna R Maglott,Kim D Pruitt,Tatiana Tatusova,Terence D Murphy

doi:10.1093/nar/gku1055

Garth R Brown, Michael Ovetsky + Show 9 more

Open Access

https://doi.org/10.1093/nar/gku1055

Copy DOI

Abstract

The National Center for Biotechnology Information's (NCBI) Gene database (www.ncbi.nlm.nih.gov/gene) integrates gene-specific information from multiple data sources. NCBI Reference Sequence (RefSeq) genomes for viruses, prokaryotes and eukaryotes are the primary foundation for Gene records in that they form the critical association between sequence and a tracked gene upon which additional functional and descriptive content is anchored. Additional content is integrated based on the genomic location and RefSeq transcript and protein sequence data. The content of a Gene record represents the integration of curation and automated processing from RefSeq, collaborating model organism databases, consortia such as Gene Ontology, and other databases within NCBI. Records in Gene are assigned unique, tracked integers as identifiers. The content (citations, nomenclature, genomic location, gene products and their attributes, phenotypes, sequences, interactions, variation details, maps, expression, homologs, protein domains and external databases) is available via interactive browsing through NCBI's Entrez system, via NCBI's Entrez programming utilities (E-Utilities and Entrez Direct) and for bulk transfer by FTP.

Full Text