Abstract

BackgroundFlow cytometry, with its high throughput nature, combined with the ability to measure an increasing number of cell parameters at once can surpass the throughput of prevalent genomic and metagenomic approaches in the study of microbiomes. Novel computational approaches to analyze flow cytometry data will result in greater insights and actionability as compared to traditional tools used in the analysis of microbiomes. This paper is a demonstration of the fruitfulness of machine learning in analyzing microbial flow cytometry data generated in anaerobic microbiome perturbation experiments.ResultsAutoencoders were found to be powerful in detecting anomalies in flow cytometry data from nanoparticles and carbon sources perturbed anaerobic microbiomes but was marginal in predicting perturbations due to antibiotics. A comparison between different algorithms based on predictive capabilities suggested that gradient boosting (GB) and deep learning, i.e. feed forward artificial neural network with three hidden layers (DL) were marginally better under tested conditions at predicting overall community structure while distributed random forests (DRF) worked better for predicting the most important putative microbial group(s) in the anaerobic digesters viz. methanogens, and it can be optimized with better parameter tuning. Predictive classification patterns with DL (feed forward artificial neural network with three hidden layers) were found to be comparable to previously demonstrated multivariate analysis. The potential applications of this approach have been demonstrated for monitoring the syntrophic resilience of the anaerobic microbiomes perturbed by synthetic nanoparticles as well as antibiotics.ConclusionMachine learning can benefit the microbial flow cytometry research community by providing rapid screening and characterization tools to discover patterns in the dynamic response of microbiomes to several stimuli.

Highlights

  • Flow cytometry, with its high throughput nature, combined with the ability to measure an increasing number of cell parameters at once can surpass the throughput of prevalent genomic and metagenomic approaches in the study of microbiomes

  • TC2 might have started displaying maximum perturbation effects right from d 5 as Discussion The machine learning approaches to the microbial flow cytometry dataset are still in infancy but the results presented here from the analysis of 1,500,000 microbes corresponding to each perturbation incidence demonstrate that it has much greater potential compared to the tradition microbial ecology statistical analysis like Multidimensional Scaling (MDS) as demonstrated in our previous publication [17] or similar Principal Component Analysis (PCA)

  • Autoencoders were found to be powerful in detecting anomalies in flow cytometry data from nanoparticleand carbon source-perturbed anaerobic microbiomes but marginally so for antibiotic-perturbed communities

Read more

Summary

Introduction

With its high throughput nature, combined with the ability to measure an increasing number of cell parameters at once can surpass the throughput of prevalent genomic and metagenomic approaches in the study of microbiomes. Novel computational approaches to analyze flow cytometry data will result in greater insights and actionability as compared to traditional tools used in the analysis of microbiomes. The current processing pipeline for NGS requires ~ 14 h compared to ~ 2 h for flow cytometry. At this time, flow cytometry is significantly more high throughput for resolving rapid dynamic changes in the structure and function of the microbial communities over time, which is crucial for studying health and wellness in dynamic biosystems [5]. Since its genesis in 1965 [10] and increased popularity since the 1970s [11], the basic design of flow cytometers has remained almost unchanged, which emphasizes its technological robustness and is an ideal tool for building actionable solutions for microbiome research community

Objectives
Methods
Results
Discussion
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.