Abstract
Defining a microbial community and identifying bacteria, at least at the genus level, is a first step in predicting the behavior of a microbial community in bioremediation. In biological treatment systems, the most dominating groups observed are Pseudomonas, Moraxella, Acinetobactor, Burkholderia, and Alcaligenes. Our interest lies in identifying the distinguishing features of these bacterial groups based on their 16S rDNA sequence data, which could be used further for generating genus-specific probes. Accordingly, 20 sequences representing different species from each genus above were retrieved, which constituted a training set. A 16-dimensional feature vector comprised of transition probabilities of nucleotides was considered and each sampled sequence was expressed in terms of these features. A stepwise feature selection method was used to identify features that are distinct across the species of these five groups. Wilk's lambda selection criterion was used and resulted in a subset with six distinguishing features. The discriminating efficacy of this subset was tested through multiple group discriminant analysis. Two linear composites, as a function of these features, could discriminate the test set of forty-five sequences from these groups with 95% accuracy, thereby ascertaining the relevance of the identified features. The geometric representation of feature correlation in the reduced discriminant space demonstrated the dominance of identified features in specific groups. These features independently or in combination could be used to generate genus-specific patterns to design probes, so as to develop a tracking tool for the selected group of bacteria.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Journal of computational biology : a journal of computational molecular cell biology
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.