GENCODE 2025: reference gene annotation for human and mouse.

Jonathan M Mudge,Sílvia Carbonell-Sala,Mark Diekhans,Jose Gonzalez Martinez,Toby Hunt,Irwin Jungreis,Jane E Loveland,Carme Arnan,If Barnes,Ruth Bennett,Andrew Berry,Alexandra Bignell,Daniel Cerdán-Vélez,Kelly Cochran,Lucas T Cortés,Claire Davidson,Sarah Donaldson,Cagatay Dursun,Reham Fatima,Matthew Hardy,Prajna Hebbar,Zoe Hollis,Benjamin T James,Yunzhe Jiang,Rory Johnson,Gazaldeep Kaur,Mike Kay,Riley J Mangan,Miguel Maquedano,Laura Martínez Gómez,Nourhen Mathlouthi,Ryan Merritt,Pengyu Ni,Emilio Palumbo,Tamara Perteghella,Fernando Pozo,Shriya Raj,Cristina Sisu,Emily Steed,Dulika Sumathipala,Marie-Marthe Suner,Barbara Uszczynska-Ratajczak,Elizabeth Wass,Yucheng T Yang,Dingyao Zhang,Robert D Finn,Mark Gerstein,Roderic Guigó,Tim J P Hubbard,Manolis Kellis,Anshul Kundaje,Benedict Paten,Michael L Tress,Ewan Birney,Fergal J Martin,Adam Frankish

doi:10.1093/nar/gkae1078

Jonathan M Mudge, Sílvia Carbonell-Sala + Show 54 more

Open Access

PDF Available

https://doi.org/10.1093/nar/gkae1078

Copy DOI

Export

Save

Cite

Journal: Nucleic acids research	Publication Date: Nov 20, 2024
Citations: 1	License type: CC BY 4.0

Abstract
Full-Text PDF
Similar Papers

Abstract

Listen

GENCODE produces comprehensive reference gene annotation for human and mouse. Entering its twentieth year, the project remains highly active as new technologies and methodologies allow us to catalog the genome at ever-increasing granularity. In particular, long-read transcriptome sequencing enables us to identify large numbers of missing transcripts and to substantially improve existing models, and our long non-coding RNA catalogs have undergone a dramatic expansion and reconfiguration as a result. Meanwhile, we are incorporating data from state-of-the-art proteomics and Ribo-seq experiments to fine-tune our annotation of translated sequences, while further insights into function can be gained from multi-genome alignments that grow richer as more species' genomes are sequenced. Such methodologies are combined into a fully integrated annotation workflow. However, the increasing complexity of our resources can present usability challenges, and we are resolving these with the creation of filtered genesets such as MANE Select and GENCODE Primary. The next challenge is to propagate annotations throughout multiple human and mouse genomes, as we enter the pangenome era. Our resources are freely available at our web portal www.gencodegenes.org, and via the Ensembl and UCSC genome browsers.

Full Text