Unraveling the complex structure and functioning of microbial communities is essential to accurately predict the impact of perturbations and/or environmental changes. From all molecular tools available today to resolve the dynamics of microbial communities, metaproteomics stands out, allowing the establishment of phenotype-genotype linkages.Despite its rapid development, this technology has faced many technical challenges that still hamper its potential power. How to maximize the number of protein identification, improve quality of protein annotation and provide reliable ecological interpretation, are questions of immediate urgency. In our study, we used a robust metaproteomic workflow combining two protein fractionation approaches (gel-based versus gel-free) and four protein search databases derived from the same metagenome to analyze the same seawater sample. The resulting eight metaproteomes provided different outcomes in terms of (i) total protein numbers, (ii) taxonomic structures, and (iii) protein functions. The characterization and/or representativeness of numerous proteins from ecologically relevant taxa such as Pelagibacterales, Rhodobacterales and Synechococcales, as well as crucial environmental processes, such as nutrient uptake, nitrogen assimilation, light harvesting and oxidative stress response were found to be particularly affected by the methodology. Our results provide clear evidences that the use of different protein search databases significantly alters the biological conclusions in both gel-free and gel-based approaches. Our findings emphasize the importance of diversifying the experimental workflow for a comprehensive metaproteomic study.
BackgroundMetaproteomics aims at characterizing the total proteins obtained from microbial communities [1] and, in association with metagenomics, unraveling the functional complexity of a given ecosystem [2]. Since the first environmental metaproteomic study performed in the Chesapeake Bay [3], numerous investigations were carried out in a variety of environments using descriptive, comparative and/or quantitative approaches [4]. Comparative metaproteomics was often used to describe spatial and seasonal changes in aquatic ecosystems using (i) in situ [5][6][7][8] (ii) mesocosms [9, 10] or (iii) microcosms [11] approaches.Metaproteomics on marine ecosystems is a rapidly expanding field that involves a series of challenging steps and critical decisions in its workflow [4,[12][13][14]. The marine metaproteomic workflow consists mainly of four steps: (i) sampling and protein extraction, (ii) protein separation, (iii) mass spectrometry, and (iv) protein identification/annotation [15]. Until now, standardized experimental protocols are still missing, leading to methodological inconsistencies and data interpretation biases across metaproteomic studies [16][17][18].Protein identification strongly relies on both the quality of experimental mass spectra and the comprehensiveness of the protein search database (DB) [15]. Both gel-based [19] and shotgun gel-free [5,...