The gut microbiome is implicated in the pathogenesis of colorectal cancer (CRC), but the full scope of this dialogue is unknown. Here we aimed to define the scale and membership of the body of CRC- and health-associated gut bacteria in global populations. We performed a microbiome-CRC correlation analysis of published ultra-deep shotgun metagenomic sequencing data from global microbiome surveys, utilizing a de novo (reference-agnostic) gene-level clustering approach to identify protein-coding co-abundant gene (CAGs) clusters. We link an unprecedented ~ 23–40% of gut bacteria to CRC or health, split nearly evenly as CRC- or health-associated. These microbes encode 2319 CAGs encompassing 427,261 bacterial genes significantly enriched or depleted in CRC. We identified many microbes that had not previously been linked to CRC, thus expanding the scope of “known unknowns” of CRC-associated microbes. We performed an agnostic CAG-based screen of bacterial isolates and validated predicted effects of previously unimplicated bacteria in preclinical models, in which we observed differential induction of precancerous adenomas and field effects. Single-cell RNA sequencing disclosed microbiome-induced senescence-associated gene expression signatures in discrete colonic populations including fibroblasts. In organoid co-cultures, primary colon fibroblasts from mice with microbiomes promoted significantly greater growth than fibroblasts from microbiome-depleted mice. These results offer proof-of-principle for gene-level metagenomic analysis enabling discovery of microbiome links to health and demonstrate that the microbiome can drive precancer states, thereby potentially revealing novel cancer prevention opportunities.
Read full abstract