Abstract

Microbial communities in biological stains can provide valuable information to assist forensic scientists identify the body fluid/tissue present in these. As these microbial communities are characteristic of body habitats, DNA sequencing of microbes can be used to predict bodily origin. Promising predictive results have been obtained with supervised machine learning algorithms trained on bacterial abundance data from human body sites. Importantly, prediction accuracy is dependent on the training dataset, yet compiling a large and comprehensive training reference is a non-trivial issue requiring substantial efforts. Here we present a new online database and associated data-mining platform which is, to our knowledge, the first one customised for forensic scientists investigating body fluids/tissues. Our database features samples originating from ten human body sites, with selection options through an online platform. Users can download bacterial abundance as well as taxonomic data, which can then be used to train predictive models and test their accuracy. Future stages of the development of the platform will include curation of the samples to decrease potential errors in sample labelling, as well as access to an online tool to conduct exploratory analyses.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.