Sepsis is a dysfunctional host response to infection. The syndrome leads to millions of deaths annually (19.7% of all deaths in 2017) and is the cause of most deaths from severe Covid infections. High throughput sequencing or 'omics' experiments in molecular and clinical sepsis research have been widely utilized to identify new diagnostics and therapies. Transcriptomics, quantifying gene expression, has dominated these studies, due to the efficiency of measuring gene expression in tissues and the technical accuracy of technologies like RNA-Seq. Most of these studies seek to uncover novel mechanistic insights into sepsis pathogenesis and diagnostic gene signatures by identifying genes differentially expressed between two or more relevant conditions. However, little effort has been made, to date, to aggregate this knowledge from such studies. In this study we sought to build a compendium of previously described gene sets that combines knowledge gained from sepsis-associated studies. This would enable the identification of genes most associated with sepsis pathogenesis, and the description of the molecular pathways commonly associated with sepsis. PubMed was searched for studies using transcriptomics to characterize acute infection/sepsis and severe sepsis (i.e., sepsis combined with organ failure). Several studies were identified that used transcriptomics to identify differentially expressed (DE) genes, predictive/prognostic signatures, and underlying molecular responses and pathways. The molecules included in each gene set were collected, in addition to the relevant study metadata (e.g., patient groups used for comparison, sample collection time point, tissue type, etc.). After performing extensive literature curation of 74 sepsis-related publications involving transcriptomics, 103 unique gene sets (comprising 20,899 unique genes) from thousands of patients were collated together with associated metadata. Frequently described genes included in gene sets as well as the molecular mechanisms they were involved in were identified. These mechanisms included neutrophil degranulation, generation of second messenger molecules, IL-4 and -13 signaling, and IL-10 signaling among many others. The database, which we named SeptiSearch, is made available in a web application created using the Shiny framework in R, (available at https://septisearch.ca). SeptiSearch provides members of the sepsis community the bioinformatic tools needed to leverage and explore the gene sets contained in the database. This will allow the gene sets to be further scrutinized and analyzed for their enrichment in user-submitted gene expression data and used for validation of in-house gene sets/signatures.
Read full abstract