Distinguishing potential bacteria-tumor associations from contamination in a secondary data analysis of public cancer genome sequence data

Kelly M Robinson,John S A Mattick,Jonathan Crabtree,Kathleen E Anderson,Julie C Dunning Hotopp

doi:10.1186/s40168-016-0224-8

Abstract

BackgroundA variety of bacteria are known to influence carcinogenesis. Therefore, we sought to investigate if publicly available whole genome and whole transcriptome sequencing data generated by large public cancer genome efforts, like The Cancer Genome Atlas (TCGA), could be used to identify bacteria associated with cancer. The Burrows-Wheeler aligner (BWA) was used to align a subset of Illumina paired-end sequencing data from TCGA to the human reference genome and all complete bacterial genomes in the RefSeq database in an effort to identify bacterial read pairs from the microbiome.ResultsThrough careful consideration of all of the bacterial taxa present in the cancer types investigated, their relative abundance, and batch effects, we were able to identify some read pairs from certain taxa as likely resulting from contamination. In particular, the presence of Mycobacterium tuberculosis complex in the ovarian serous cystadenocarcinoma (OV) and glioblastoma multiforme (GBM) samples was correlated with the sequencing center of the samples. Additionally, there was a correlation between the presence of Ralstonia spp. and two specific plates of acute myeloid leukemia (AML) samples. At the end, associations remained between Pseudomonas-like and Acinetobacter-like read pairs in AML, and Pseudomonas-like read pairs in stomach adenocarcinoma (STAD) that could not be explained through batch effects or systematic contamination as seen in other samples.ConclusionsThis approach suggests that it is possible to identify bacteria that may be present in human tumor samples from public genome sequencing data that can be examined further experimentally. More weight should be given to this approach in the future when bacterial associations with diseases are suspected.

Highlights

A variety of bacteria are known to influence carcinogenesis
While most cancer-related bacteria are the dominant member of the microbiome, it is possible that rare members could cause driver mutations and/or that dominant members might be more abundant in tumors due to a favorable tumor microenvironment
Bacterial presence in The Cancer Genome Atlas (TCGA) data In order to determine the microbial component of various cancers, we analyzed the relative abundance of bacteria-derived paired-end Illumina sequencing in TCGA data that was made available in the Sequence Read Archive (SRA), as previously described [30]

Summary

Introduction

We sought to investigate if publicly available whole genome and whole transcriptome sequencing data generated by large public cancer genome efforts, like The Cancer Genome Atlas (TCGA), could be used to identify bacteria associated with cancer. The Burrows-Wheeler aligner (BWA) was used to align a subset of Illumina paired-end sequencing data from TCGA to the human reference genome and all complete bacterial genomes in the RefSeq database in an effort to identify bacterial read pairs from the microbiome. A subset of these viruses are known to integrate into the human genome [2, 3], while viruses, parasites, and bacteria can all promote cancer through other mechanisms [4]. Of the bacteria known to be associated with carcinogenesis, the mechanisms linking H. pylori to gastric carcinoma and gastric mucosa-associated lymphoid tissue (MALT) lymphoma are best understood [5]. While most cancer-related bacteria are the dominant member of the microbiome, it is possible that rare members could cause driver mutations and/or that dominant members might be more abundant in tumors due to a favorable tumor microenvironment

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Microbiome	Publication Date: Jan 25, 2017
Citations: 67	License type: open-access

R Discovery Prime

Distinguishing potential bacteria-tumor associations from contamination in a secondary data analysis of public cancer genome sequence data

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Microbiome

Lead the way for us

Similar Papers

Abstract 1534: The microbiome of cancer as seen through the lens of public genome sequence data
Kelly M Robinson ... Karsten B Sieber
Cancer Research | VOL. 75
Kelly M Robinson, et. al.Kelly M Robinson ... Karsten B Sieber
01 Aug 2015
Abstract 1534: The microbiome of cancer as seen through the lens of public genome sequence data
Kelly M Robinson ... Karsten B Sieber

Batch effects in population genomic studies with low-coverage whole genome sequencing data: Causes, detection and mitigation.
Runyang Nicolas Lou ... Nina Overgaard Therkildsen
Molecular Ecology Resources | VOL. 22
Runyang Nicolas Lou, et. al.Runyang Nicolas Lou ... Nina Overgaard Therkildsen
09 Dec 2021
Molecular Ecology Resources | VOL. 22

Author response: Comprehensive and unbiased multiparameter high-throughput screening by compaRe finds effective and subtle drug responses in AML models
Morteza Chalabi Hajkarim ... Krister Wennerberg
-
Morteza Chalabi Hajkarim, et. al.Morteza Chalabi Hajkarim ... Krister Wennerberg
25 Jan 2022
25 Jan 2022

Abstract 4856: ITD Assembler: An algorithm for internal tandem duplication discovery from short-read sequencing data
Oliver A Hampton ... Marek Kimmel
Cancer Research | VOL. 75
Oliver A Hampton, et. al.Oliver A Hampton ... Marek Kimmel
01 Aug 2015
Abstract 4856: ITD Assembler: An algorithm for internal tandem duplication discovery from short-read sequencing data
Oliver A Hampton ... Marek Kimmel

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Distinguishing potential bacteria-tumor associations from contamination in a secondary data analysis of public cancer genome sequence data

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Microbiome