The Integration of Data from Different Long-Read Sequencing Platforms Enhances Proteoform Characterization in Arabidopsis.

Lara García-Campa,Luis Valledor,Jesús Pascual

doi:10.3390/plants12030511

Lara García-Campa, Luis Valledor + Show 1 more

Open Access

https://doi.org/10.3390/plants12030511

Copy DOI

Journal: Plants (Basel, Switzerland)	Publication Date: Jan 22, 2023
Citations: 2	License type: CC BY 4.0

Affiliation: University of Oviedo

Abstract

The increasing availability of massive omics data requires improving the quality of reference databases and their annotations. The combination of full-length isoform sequencing (Iso-Seq) with short-read transcriptomics and proteomics has been successfully used for increasing proteoform characterization, which is a main ongoing goal in biology. However, the potential of including Oxford Nanopore Technologies Direct RNA Sequencing (ONT-DRS) data has not been explored. In this paper, we analyzed the impact of combining Iso-Seq- and ONT-DRS-derived data on the identification of proteoforms in Arabidopsis MS proteomics data. To this end, we selected a proteomics dataset corresponding to senescent leaves and we performed protein searches using three different protein databases: AtRTD2 and AtRTD3, built from the homonymous transcriptomes, regarded as the most complete and up-to-date available for the species; and a custom hybrid database combining AtRTD3 with publicly available ONT-DRS transcriptomics data generated from Arabidopsis leaves. Our results show that the inclusion and combination of long-read sequencing data from Iso-Seq and ONT-DRS into a proteogenomic workflow enhances proteoform characterization and discovery in bottom-up proteomics studies. This represents a great opportunity to further investigate biological systems at an unprecedented scale, although it brings challenges to current protein searching algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

The Integration of Data from Different Long-Read Sequencing Platforms Enhances Proteoform Characterization in Arabidopsis.

Abstract

Talk to us

Similar Papers

More From: Plants (Basel, Switzerland)

Lead the way for us

Similar Papers

Long-read RNA sequencing analysis of the lytic human cytomegalovirus transcriptome
Zsolt Balázs
-
Zsolt BalázsZsolt Balázs
05 Sep 2019
05 Sep 2019

Characterization of Proteoforms with Unknown Post-translational Modifications Using the MIScore.
Qiang Kou ... Xiaowen Liu
Journal of Proteome Research | VOL. 15
Qiang Kou, et. al.Qiang Kou ... Xiaowen Liu
01 Jul 2016
Journal of Proteome Research | VOL. 15

Genome sequence of the barred knifejaw Oplegnathus fasciatus (Temminck & Schlegel, 1844): the first chromosome-level draft genome in the family Oplegnathidae.
Yongshuang Xiao ... Daoyuan Ma
GigaScience | VOL. 8
Yongshuang Xiao, et. al.Yongshuang Xiao ... Daoyuan Ma
01 Feb 2019
GigaScience | VOL. 8

TopMSV: A Web-Based Tool for Top-Down Mass Spectrometry Data Visualization.
In Kwon Choi ... Si Wu
Journal of the American Society for Mass Spectrometry | VOL. 32
In Kwon Choi, et. al.In Kwon Choi ... Si Wu
29 Mar 2021
Journal of the American Society for Mass Spectrometry | VOL. 32

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Integration of Data from Different Long-Read Sequencing Platforms Enhances Proteoform Characterization in Arabidopsis.

Abstract

Talk to us

Similar Papers

More From: Plants (Basel, Switzerland)