Fit-for-purpose curated database application in mass spectrometry-based targeted protein identification and validation.

Keding Cheng,Shawn Babiuk,Angela Sloan,Stuart Mccorrister,Gehua Wang,Timothy R Bowden,J David Knox

doi:10.1186/1756-0500-7-444

Abstract

BackgroundMass spectrometry (MS) is a very sensitive and specific method for protein identification, biomarker discovery, and biomarker validation. Protein identification is commonly carried out by comparing MS data with public databases. However, with the development of high throughput and accurate genomic sequencing technology, public databases are being overwhelmed with new entries from different species every day. The application of these databases can also be problematic due to factors such as size, specificity, and unharmonized annotation of the molecules of interest. Current databases representing liquid chromatography-tandem mass spectrometry (LC-MS/MS)-based searches focus on enzyme digestion patterns and sequence information and consequently, important functional information can be missed within the search output. Protein variants displaying similar sequence homology can interfere with database identification when only certain homologues are examined. In addition, recombinant DNA technology can result in products that may not be accurately annotated in public databases. Curated databases, which focus on the molecule of interest with clearer functional annotation and sequence information, are necessary for accurate protein identification and validation. Here, four cases of curated database application have been explored and summarized.FindingsThe four presented curated databases were constructed with clear goals regarding application and have proven very useful for targeted protein identification and biomarker application in different fields. They include a sheeppox virus database created for accurate identification of proteins with strong antigenicity, a custom database containing clearly annotated protein variants such as tau transcript variant 2 for accurate biomarker identification, a sheep-hamster chimeric prion protein (PrP) database constructed for assay development of prion diseases, and a custom Escherichia coli (E. coli) flagella (H antigen) database produced for MS-H, a new H-typing technique. Clearly annotating the proteins of interest was essential for highly accurate, specific, and sensitive sequence identification, and searching against public databases resulted in inaccurate identification of the sequence of interest, while combining the curated database with a public database reduced both the confidence and sequence coverage of the protein search.ConclusionCurated protein sequence databases incorporating clear annotations are very useful for accurate protein identification and fit-for-purpose application through MS-based biomarker validation.

Highlights

Summary

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Research Notes	Publication Date: Jul 10, 2014
Citations: 24	License type: cc-by

R Discovery Prime

R Discovery Prime

Fit-for-purpose curated database application in mass spectrometry-based targeted protein identification and validation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Research Notes

Lead the way for us

Similar Papers

The species identification problem in mirids (Hemiptera: Heteroptera) highlighted by DNA barcoding and species delimitation studies
L Piemontese ... M Cesari
The European Zoological Journal | VOL. 87
L Piemontese, et. al.L Piemontese ... M Cesari
01 Jan 2020
The European Zoological Journal | VOL. 87

Protein Identification False Discovery Rates for Very Large Proteomics Data Sets Generated by Tandem Mass Spectrometry
Lukas Reiter ... Ruedi Aebersold
Molecular & Cellular Proteomics | VOL. 8
Lukas Reiter, et. al.Lukas Reiter ... Ruedi Aebersold
01 Nov 2009
Molecular & Cellular Proteomics | VOL. 8

Ribosome Display and Photo-Cross-Linking Techniques for In Vitro Identification of Target Proteins of Bioactive Small Molecules
Akira Wada ... Hiroyuki Osada
Analytical Chemistry | VOL. 86
Akira Wada, et. al.Akira Wada ... Hiroyuki Osada
21 Jan 2014
Analytical Chemistry | VOL. 86

Interpretation of Shotgun Proteomic Data
Alexey I Nesvizhskii ... Ruedi Aebersold
Molecular & Cellular Proteomics | VOL. 4
Alexey I Nesvizhskii, et. al.Alexey I Nesvizhskii ... Ruedi Aebersold
11 Jul 2005
Molecular & Cellular Proteomics | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Fit-for-purpose curated database application in mass spectrometry-based targeted protein identification and validation.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Research Notes