Toward a high-quality pan-genome landscape of Bacillus subtilis by removal of confounding strains.

Hao Wu,Dan Wang,Feng Gao

doi:10.1093/bib/bbaa013

Abstract

Pan-genome analysis is widely used to study the evolution and genetic diversity of species, particularly in bacteria. However, the impact of strain selection on the outcome of pan-genome analysis is poorly understood. Furthermore, a standard protocol to ensure high-quality pan-genome results is lacking. In this study, we carried out a series of pan-genome analyses of different strain sets of Bacillus subtilis to understand the impact of various strains on the performance and output quality of pan-genome analyses. Consequently, we found that the results obtained by pan-genome analyses of B. subtilis can be influenced by the inclusion of incorrectly classified Bacillus subspecies strains, phylogenetically distinct strains, engineered genome-reduced strains, chimeric strains, strains with a large number of unique genes or a large proportion of pseudogenes, and multiple clonal strains. Since the presence of these confounding strains can seriously affect the quality and true landscape of the pan-genome, we should remove these deviations in the process of pan-genome analyses. Our study provides new insights into the removal of biases from confounding strains in pan-genome analyses at the beginning of data processing, which enables the achievement of a closer representation of a high-quality pan-genome landscape of B. subtilis that better reflects the performance and credibility of the B. subtilis pan-genome. This procedure could be added as an important quality control step in pan-genome analyses for improving the efficiency of analyses, and ultimately contributing to a better understanding of genome function, evolution and genome-reduction strategies for B. subtilis in the future.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Toward a high-quality pan-genome landscape of Bacillus subtilis by removal of confounding strains.

Abstract

Talk to us

Similar Papers

More From: Briefings in Bioinformatics

Lead the way for us

Journal: Briefings in Bioinformatics	Publication Date: Feb 17, 2020
Citations: 39

Similar Papers

Pan-genomics of Ochrobactrum species from clinical and environmental origins reveals distinct populations and possible links
Kushal Gohil ... Mahesh Dharne
Genomics | VOL. 112
Kushal Gohil, et. al.Kushal Gohil ... Mahesh Dharne
17 May 2020
Genomics | VOL. 112

First Steps in the Analysis of Prokaryotic Pan-Genomes.
Sávio Souza Costa ... Artur Silva
Bioinformatics and Biology Insights | VOL. 14
Sávio Souza Costa, et. al.Sávio Souza Costa ... Artur Silva
01 Jan 2020
Bioinformatics and Biology Insights | VOL. 14

Pan-Genomic Analysis Provides Insights into the Genomic Variation and Evolution of Salmonella Paratyphi A
Weili Liang ... Chunxia Chen
PLoS ONE | VOL. 7
Weili Liang, et. al.Weili Liang ... Chunxia Chen
19 Sep 2012
PLoS ONE | VOL. 7

Community genetic interactions mediate indirect ecological effects between a parasitoid wasp and rhizobacteria
Sharon E Zytynska ... Richard F Preziosi
Ecology | VOL. 91
Sharon E Zytynska, et. al.Sharon E Zytynska ... Richard F Preziosi
01 Jun 2010
Ecology | VOL. 91

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Toward a high-quality pan-genome landscape of Bacillus subtilis by removal of confounding strains.

Abstract

Talk to us

Similar Papers

More From: Briefings in Bioinformatics