Progress in molecular methods has enabled the monitoring of bacterial populations in time. Nevertheless, understanding community dynamics and its links with ecosystem functioning remains challenging due to the tremendous diversity of microorganisms. Conceptual frameworks that make sense of time series of taxonomically rich bacterial communities, regarding their potential ecological function, are needed. A key concept for organizing ecological functions is the niche, the set of strategies that enable a population to persist and define its impacts on the surroundings. Here we present a framework based on manifold learning to organize genomic information into potentially occupied bacterial metabolic niches over time. Manifold learning tries to uncover low-dimensional data structures in high-dimensional data sets that can be used to describe the data in reduced dimensions. We apply the method to re-construct the dynamics of putatively occupied metabolic niches using a long-term bacterial time series from the Baltic Sea, the Linnaeus Microbial Observatory (LMO). The results reveal a relatively low-dimensional space of occupied metabolic niches comprising groups of taxa with similar functional capabilities. Time patterns of occupied niches were strongly driven by seasonality. Some metabolic niches were dominated by one bacterial taxon, whereas others were occupied by multiple taxa, depending on the season. These results illustrate the power of manifold learning approaches to advance our understanding of the links between community composition and functioning in microbial systems. IMPORTANCE The increase in data availability of bacterial communities highlights the need for conceptual frameworks to advance our understanding of these complex and diverse communities alongside the production of such data. To understand the dynamics of these tremendously diverse communities, we need tools to identify overarching strategies and describe their role and function in the ecosystem in a comprehensive way. Here, we show that a manifold learning approach can coarse grain bacterial communities in terms of their metabolic strategies and that we can thereby quantitatively organize genomic information in terms of potentially occupied niches over time. This approach, therefore, advances our understanding of how fluctuations in bacterial abundances and species composition can relate to ecosystem functions and it can facilitate the analysis, monitoring, and future predictions of the development of microbial communities.
Read full abstract