Abstract
AbstractThe development of MSnbase aims at providing researchers dealing with labelled quantitative proteomics data with a transparent, portable, extensible and open-source collaborative framework to easily manipulate and analyse MS2-level raw tandem mass spectrometry data. The implementation in R gives users and developers a great variety of powerful tools to be used in a controlled and reproducible way. Furthermore, MSnbase has been developed following an object-oriented programming paradigm: all information that is manipulated by the user is encapsulated in ad hoc data containers to hide it's underlying complexity. We illustrate the usage and achievements of our software using a published spiked-in data set in which varying quantities of test proteins have been labelled with four different iTRAQ tags. In addition to providing raw MSMS data, MSnbase also stores meta-data and logs processing steps in the data object itself for optimal traceability. We provide graphics on how to inspect precursor data for quality control and how individual or merged MSMS spectra can subsequently be processed, plotted and extracted using a variety of methods. We also demonstrate how reporter ions (or any peaks of interest defined by the user) can easily be quantified and normalised using several build-in alternative strategies and how the effect of each transformation can be recorded, examined and reproduced. MSnbase constitutes a unique versatile working and development environment to process labelled MSMS data and provides in turn important feedback for data acquisition optimisation. We conclude by presenting future extensions of MSnbase and highlight its usage in reproducible proteomics research.
Highlights
The development of MSnbase aims at providing researchers dealing with labelled quantitative proteomics data with a transparent, portable, extensible and open-source collaborative framework to manipulate and analyse MS2-level tandem mass spectrometry data
We created a new set of reporter ions, including the 4 iTRAQ tags and a virtual tag at MZ 145, which corresponds to partially fragmented reporter tags and balance groups and quantified these 5 peaks
Figure: Four exogenous proteins have been spiked in an Erwinia background and labelled with 4 iTRAQ labels (BSA 1:2:3:4, cytochrome C and phosphorylase B 1:1:1:1, Enolase 4:3:2:1)
Summary
The development of MSnbase aims at providing researchers dealing with labelled quantitative proteomics data with a transparent, portable, extensible and open-source collaborative framework to manipulate and analyse MS2-level tandem mass spectrometry data. MSnbase has been developed following an object-oriented programming paradigm: all information that is manipulated by the user is encapsulated in ad hoc data containers to hide it’s underlying complexity. The implementation in R gives users and developers a great variety of powerful tools to be used in a controlled and reproducible way
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.