Increasing the power of interpretation for soil metaproteomics data

Virginie Jouffret,Sophie Ayrault,Guylaine Miotello,Olivier Pible,Karen Culotta,Jean Armengaud

doi:10.1186/s40168-021-01139-1

Abstract

BackgroundSoil and sediment microorganisms are highly phylogenetically diverse but are currently largely under-represented in public molecular databases. Their functional characterization by means of metaproteomics is usually performed using metagenomic sequences acquired for the same sample. However, such hugely diverse metagenomic datasets are difficult to assemble; in parallel, theoretical proteomes from isolates available in generic databases are of high quality. Both these factors advocate for the use of theoretical proteomes in metaproteomics interpretation pipelines. Here, we examined a number of database construction strategies with a view to increasing the outputs of metaproteomics studies performed on soil samples.ResultsThe number of peptide-spectrum matches was found to be of comparable magnitude when using public or sample-specific metagenomics-derived databases. However, numbers were significantly increased when a combination of both types of information was used in a two-step cascaded search. Our data also indicate that the functional annotation of the metaproteomics dataset can be maximized by using a combination of both types of databases.ConclusionsA two-step strategy combining sample-specific metagenome database and public databases such as the non-redundant NCBI database and a massive soil gene catalog allows maximizing the metaproteomic interpretation both in terms of ratio of assigned spectra and retrieval of function-derived information.4F9sWsX3PjS31f5TKAyCkGVideo abstract

Highlights

Soil and sediment microorganisms are highly phylogenetically diverse but are currently largely underrepresented in public molecular databases
Benchmarking databases created from sample-specific metagenomics data Different databases built from metagenomics data acquired on a sediment sample were evaluated for metaproteomics based on the number of Peptide-to-spectrum matches (PSMs) as main criterion
A soil core was collected from the Seine River floodplain at the Bouafles site (France) located downstream of Paris

Summary

Introduction

Soil and sediment microorganisms are highly phylogenetically diverse but are currently largely underrepresented in public molecular databases Their functional characterization by means of metaproteomics is usually performed using metagenomic sequences acquired for the same sample. Such hugely diverse metagenomic datasets are difficult to assemble; in parallel, theoretical proteomes from isolates available in generic databases are of high quality. Soils are open systems exposed to highly variable environmental parameters such as Jouffret et al Microbiome (2021) 9:195 extraction, metaproteomics methods must be developed to suit each soil type [32, 63] Despite these difficulties, several pioneering studies have been performed on soils extracted from forests [40, 74], arid environments [7], agricultural areas [39, 50], permafrost [27], and from mining drainage [53]. Sediments — deposited material arising from weathering, erosion, and transport processes — contain complex microbial ecosystems [19, 64]

Objectives

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Microbiome	Publication Date: Sep 29, 2021
Citations: 31	License type: open-access

R Discovery Prime

R Discovery Prime

Increasing the power of interpretation for soil metaproteomics data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Microbiome

Lead the way for us

Similar Papers

Enhanced Peptide Identification by Electron Transfer Dissociation Using an Improved Mascot Percolator
James C Wright ... Jyoti S Choudhary
Molecular & Cellular Proteomics | VOL. 11
James C Wright, et. al.James C Wright ... Jyoti S Choudhary
01 Aug 2012
Molecular & Cellular Proteomics | VOL. 11

An efficient ACS algorithm for classification-based peptide identification
Xijun Liang ... Ling Jian
-
Xijun Liang, et. al. Xijun Liang ... Ling Jian
01 Nov 2015
01 Nov 2015

Empirical Multidimensional Space for Scoring Peptide Spectrum Matches in Shotgun Proteomics
Mark V Ivanov ... Mikhail V Gorshkov
Journal of Proteome Research | VOL. 13
Mark V Ivanov, et. al.Mark V Ivanov ... Mikhail V Gorshkov
13 Mar 2014
Journal of Proteome Research | VOL. 13

Alignment of gene expression profiles from test samples against a reference database: New method for context-specific interpretation of microarray data
Sami K Kilpinen ... Olli P Kallioniemi
BioData Mining | VOL. 4
Sami K Kilpinen, et. al.Sami K Kilpinen ... Olli P Kallioniemi
31 Mar 2011
BioData Mining | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Increasing the power of interpretation for soil metaproteomics data

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Microbiome