Improved structural annotation of protein-coding genes in the Meloidogyne hapla genome using RNA-Seq.

Yuelong Guo,Dahlia M Nielsen,David Mck Bird

doi:10.4161/worm.29158

Abstract

As high-throughput cDNA sequencing (RNA-Seq) is increasingly applied to hypothesis-driven biological studies, the prediction of protein coding genes based on these data are usurping strictly in silico approaches. Compared with computationally derived gene predictions, structural annotation is more accurate when based on biological evidence, particularly RNA-Seq data. Here, we refine the current genome annotation for the Meloidogyne hapla genome utilizing RNA-Seq data. Published structural annotation defines 14 420 protein-coding genes in the M. hapla genome. Of these, 25% (3751) were found to exhibit some incongruence with RNA-Seq data. Manual annotation enabled these discrepancies to be resolved. Our analysis revealed 544 new gene models that were missing from the prior annotation. Additionally, 1457 transcribed regions were newly identified on the ends of as-yet-unjoined contigs. We also searched for trans-spliced leaders, and based on RNA-Seq data, identified genes that appear to be trans-spliced. Four 22-bp trans-spliced leaders were identified using our pipeline, including the known trans-spliced leader, which is the M. hapla ortholog of SL1. In silico predictions of trans-splicing were validated by comparison with earlier results derived from an independent cDNA library constructed to capture trans-spliced transcripts. The new annotation, which we term HapPep5, is publically available at www.hapla.org.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improved structural annotation of protein-coding genes in the Meloidogyne hapla genome using RNA-Seq.

Abstract

Talk to us

Similar Papers

More From: Worm

Lead the way for us

Similar Papers

Understanding the causes of errors in eukaryotic protein-coding gene prediction: a case study of primate proteomes
Corentin Meyer ... Anne Jeannin-Girardon
BMC Bioinformatics | VOL. 21
Corentin Meyer, et. al.Corentin Meyer ... Anne Jeannin-Girardon
10 Nov 2020
BMC Bioinformatics | VOL. 21

RNA-seq 定量軟體之比較

-

01 Jan 2012
01 Jan 2012

Relationship of oxidative stress and endothelial dysfunction in sleep apnoea
B Jurado-Gámez ... J.L Gómez-Chaparro
European Respiratory Journal | VOL. 37
B Jurado-Gámez, et. al.B Jurado-Gámez ... J.L Gómez-Chaparro
22 Jul 2010
European Respiratory Journal | VOL. 37

Probabilistic Methods for Computational Annotation of Genomic Sequences
Oliver Keller
-
Oliver KellerOliver Keller
20 Feb 2022
20 Feb 2022

Journal: Worm	Publication Date: Jan 1, 2014
Citations: 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improved structural annotation of protein-coding genes in the Meloidogyne hapla genome using RNA-Seq.

Abstract

Talk to us

Similar Papers

More From: Worm