Deep groundwaters are among the most energy and nutrient-limited ecosystems on the planet. The limited resources are mainly due to the absence of photosynthesis-driven primary production (Kadnikov et al. 2020). These ecosystems do however host phylogenetically diverse and metabolically active microorganisms from all domains of life plus viruses (Holmfeldt et al. 2021, Mehrshad et al. 2021). In this study, we used a large metagenomic dataset generated over the last eight years from the Äspö Hard Rock Laboratory (Äspö HRL) in Sweden and drill holes in Olkiluoto Island, Finland. This dataset, termed the “Fennoscandian Shield Genomic Dataset” (FSGD), contains metagenome-assembled genomes (MAGs) and single-cell amplified genomes (SAGs). Previous studies on this dataset have shown that reciprocal symbiotic partnerships and efficiency of energy metabolism define the core microbiome of these deep groundwaters (Mehrshad et al. 2021). Studies on different marine and freshwater ecosystems show that oligotrophic environments host streamlined genomes with lower GC content. However, it is not known how the low carbon and energy availability in deep groundwaters affect the microbial community regarding their genome size and GC content. To address this, we used the FSDG to study the distribution of genome size and GC content among bacterial and archaeal genomes in Fennoscandian Shield deep groundwaters. We further disentangled the prevalent metabolic strategies in these genomes that is being used to support their carbon and nitrogen demands for replication and survival. A total of 1990 MAGs/SAGs with a completeness of ≥50% and <5% contamination were recovered from 43 metagenomic datasets. The taxonomy of the MAGs/SAGs was assigned using the GTDB-tk (Chaumeil et al. 2022) and the GC content and genome size of MAGs/SAGs were calculated. MAGs/SAGs were also functionally annotated to investigate the genome-encoded functional potential. To survey the preference for different metabolic pathways and metabolic cross-feeding, the C-fixation pathways were subdivided into 82 routes that lead to the production of intermediate compounds (e.g., formate, pyruvate, oxaloacetate, etc.). A similar analysis for nitrogen acquisition pathways and CO2-capturing enzymes was performed and the presence/absence of these metabolic modules was investigated in representative MAGs/SAGs from various deep groundwater types to explore the microbial community’s metabolic interaction. The GC content and estimated genome size (EGS) of the recovered MAGs/SAGs were linearly correlated, suggesting that higher genome-level GC content is associated with larger genome size. The most common taxa among high GC (≥50%) content MAGs/SAGs were affiliated with the phyla Proteobacteria, Desulfobacterota, Actinobacteriota, Chloroflexota, and Patescibacteria. Analysis of metabolic modules in the environmental context revealed that high GC content MAGs constituted the main primary producers in all investigated groundwaters. Among the seven investigated C-fixation pathways, routes leading to the production of intermediates of reductive tricarboxylic acid (rTCA) were significantly enriched in high GC-content genomes. Both carboxylation/reduction first CO2-capturing enzymes were significantly enriched in MAGs with high GC contents. Of the various carbon fixation strategies, rGly and rTCA are the most energy-efficient and consume one and three ATP molecules, respectively (Bar-Even et al. 2011), matching the high prevalence of rTCA in oligotrophic deep groundwater. High GC content genomes featured metabolic pathways to generate key precursors for synthesizing purine/pyrimidine and amino acids. As GC base-pairs require one more nitrogen than AT base-pairs, prokaryotes with a higher GC content need proportionally more nitrogen for genome replication. In agreement with this, high GC content MAGs/SAGs from the FSGD feature additional nitrogen uptake systems including ammonium permease and N-fixation genes. High GC content MAGs/SAGs appear to be metabolically versatile and capable of acquiring nutrients via different carbon and nitrogen fixation pathways along with various C-scavenging strategies for substrate uptake to acquire energy and survive in oligotrophic conditions. Although their larger genome size and higher GC content entail higher expenses for replication and maintenance, their presence within the community of deep groundwater is supported by the “Black Queen Theory”. According to this theory, a cell containing an expensive metabolic pathway is maintained within the community as it provides a service that other members cannot.
Read full abstract