Abstract

BackgroundThe classification of samples on a molecular level has manifold applications, from patient classification regarding cancer treatment to phylogenetics for identifying evolutionary relationships between species. Modern methods employ the alignment of DNA or amino acid sequences, mostly not genome-wide but only on selected parts of the genome. Recently proteomics-based approaches have become popular. An established method for the identification of peptides and proteins is liquid chromatography-tandem mass spectrometry (LC-MS/MS). First, protein sequences from MS/MS spectra are identified by means of database searches, given samples with known genome-wide sequence information, then sequence based methods are applied. Alternatively, de novo peptide sequencing algorithms annotate MS/MS spectra and deduce peptide/protein information without a database. A newer approach independent of additional information is to directly compare unidentified tandem mass spectra. The challenge then is to compute the distance between pairwise MS/MS runs consisting of thousands of spectra.MethodsWe present DISMS2, a new algorithm to calculate proteome-wide distances directly from MS/MS data, extending the algorithm compareMS2, an approach that also uses a spectral comparison pipeline.ResultsOur new more flexible algorithm, DISMS2, allows for the choice of the spectrum distance measure and includes different spectra preprocessing and filtering steps that can be tailored to specific situations by parameter optimization.ConclusionsDISMS2 performs well for samples from species with and without database annotation and thus has clear advantages over methods that are purely based on database search.

Highlights

  • The classification of samples on a molecular level has manifold applications, from patient classification regarding cancer treatment to phylogenetics for identifying evolutionary relationships between species

  • Samples and LC-Mass spectrometry (MS)/MS analysis Proteolytic digests of five sequenced organisms, i.e. (i) human (Homo sapiens, H, HeLa cell line), (ii) mouse (Mus musculus, M, C2C12 cell line), (iii) yeast (Saccharomyces cerevisiae, Y), (iv) roundworm (Caenorhabditis elegans, C), and (v) fruit fly (Drosophila melanogaster, D) and of four organisms without sequenced genome, i.e. (vi) fresh water snail Radix species: molecular operational taxonomic unit (MOTU) 2 (R2), 4 (R4) and (vii) foraminifera species Amphistegina lessonii (Al), Amphistegina gibbosa (Ag) were analyzed using an Ultimate 3000 nano RSLC system coupled to a Q Exactive HF mass spectrometer

  • DISMS2 is a new user-friendly algorithm implemented in R for the proteome-wide distance calculation of different Tandem mass spectrometry (MS/MS) runs

Read more

Summary

Introduction

The classification of samples on a molecular level has manifold applications, from patient classification regarding cancer treatment to phylogenetics for identifying evolutionary relationships between species. An established method for the identification of peptides and proteins is liquid chromatography-tandem mass spectrometry (LC-MS/MS). Protein sequences from MS/MS spectra are identified by means of database searches, given samples with known genome-wide sequence information, sequence based methods are applied. A newer approach independent of additional information is to directly compare unidentified tandem mass spectra. Qualitative and quantitative proteomics strategies are useful to analyze samples measured under different conditions or samples from different phenotypes. Another application was presented by Palmblad and Deelder [3] who reconstructed. Dealing with the high complexity of protein or peptide samples, liquid chromatography as separation technique is often combined with mass spectrometry. MS/MS spectra comprise of detected intensities of occurring masses corresponding to peptide fragments

Methods
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.