Evaluation of top-down mass spectral identification with homologous protein sequences

Ziwei Li,Bo He,Qiang Kou,Weixing Feng,Si Wu,Yunlong Liu,Xiaowen Liu,Zhe Wang

doi:10.1186/s12859-018-2462-1

Abstract

BackgroundTop-down mass spectrometry has unique advantages in identifying proteoforms with multiple post-translational modifications and/or unknown alterations. Most software tools in this area search top-down mass spectra against a protein sequence database for proteoform identification. When the species studied in a mass spectrometry experiment lacks its proteome sequence database, a homologous protein sequence database can be used for proteoform identification. The accuracy of homologous protein sequences affects the sensitivity of proteoform identification and the accuracy of mass shift localization.ResultsWe tested TopPIC, a commonly used software tool for top-down mass spectral identification, on a top-down mass spectrometry data set of Escherichia coli K12 MG1655, and evaluated its performance using an Escherichia coli K12 MG1655 proteome database and a homologous protein database. The number of identified spectra with the homologous database was about half of that with the Escherichia coli K12 MG1655 database. We also tested TopPIC on a top-down mass spectrometry data set of human MCF-7 cells and obtained similar results.ConclusionsExperimental results demonstrated that TopPIC is capable of identifying many proteoform spectrum matches and localizing unknown alterations using homologous protein sequences containing no more than 2 mutations.

Highlights

Top-down mass spectrometry has unique advantages in identifying proteoforms with multiple post-translational modifications and/or unknown alterations
We present a method for proteoform identification by top-down mass spectrometry (MS) using homologous protein sequences when the species being studied lacks a proteome database
Data sets Two top-down Tandem mass spectrometry (MS/MS) data sets were used to evaluate the performance of TopPIC and how mutations in database protein sequences affect the sensitivity and accuracy of proteoform identification: the first was from Escherichia coli (EC) and the second was from MCF-7 cells

Summary

Introduction

Top-down mass spectrometry has unique advantages in identifying proteoforms with multiple post-translational modifications and/or unknown alterations. When the species studied in a mass spectrometry experiment lacks its proteome sequence database, a homologous protein sequence database can be used for proteoform identification. In the past two decades, the dominant technology in proteomics studies is bottom-up MS, in which long proteins are proteolytically digested in sample preparation, Database search is routinely used for spectral identification by top-down tandem mass spectrometry (MS/MS). In this approach, experimental MS/MS spectra are searched against theoretical spectra generated from database protein sequences to find high scoring proteoform spectrum matches (PrSMs). A top-down MS/MS spectrum is elusive to identify by database search if the proteoform that produced it contains many alterations compared with the database sequence

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Dec 1, 2018
Citations: 4	License type: open-access

R Discovery Prime

R Discovery Prime

Evaluation of top-down mass spectral identification with homologous protein sequences

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

A graph-based filtering method for top-down mass spectral identification
Runmin Yang ... Daming Zhu
BMC Genomics | VOL. 19
Runmin Yang, et. al.Runmin Yang ... Daming Zhu
01 Sep 2018
BMC Genomics | VOL. 19

Improving Proteoform Identifications in Complex Systems Through Integration of Bottom-Up and Top-Down Data.
Leah V Schaffer ... Robert J Millikin
Journal of Proteome Research | VOL. 19
Leah V Schaffer, et. al.Leah V Schaffer ... Robert J Millikin
25 Jun 2020
Journal of Proteome Research | VOL. 19

Structure-Function, Stability, and Chemical Modification of the Cyanobacterial Cytochrome b6f Complex from Nostoc sp. PCC 7120
Danas Baniulis ... William A Cramer
Journal of Biological Chemistry | VOL. 284
Danas Baniulis, et. al.Danas Baniulis ... William A Cramer
01 Apr 2009
Journal of Biological Chemistry | VOL. 284

The significant conservative and variable regions of the homologous protein sequences.
Pavel V Kostetsky ... Rimma R Vladimirova
Journal of biomolecular structure & dynamics | VOL. 9
Pavel V Kostetsky, et. al.Pavel V Kostetsky ... Rimma R Vladimirova
01 Jun 1992
Journal of biomolecular structure & dynamics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluation of top-down mass spectral identification with homologous protein sequences

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics