Higher-order Markov models for metagenomic sequence classification.

David J Burks,Rajeev K Azad

doi:10.1093/bioinformatics/btaa562

Abstract

Alignment-free, stochastic models derived from k-mer distributions representing reference genome sequences have a rich history in the classification of DNA sequences. In particular, the variants of Markov models have previously been used extensively. Higher-order Markov models have been used with caution, perhaps sparingly, primarily because of the lack of enough training data and computational power. Advances in sequencing technology and computation have enabled exploitation of the predictive power of higher-order models. We, therefore, revisited higher-order Markov models and assessed their performance in classifying metagenomic sequences. Comparative assessment of higher-order models (HOMs, 9th order or higher) with interpolated Markov model, interpolated context model and lower-order models (8th order or lower) was performed on metagenomic datasets constructed using sequenced prokaryotic genomes. Our results show that HOMs outperform other models in classifying metagenomic fragments as short as 100 nt at all taxonomic ranks, and at lower ranks when the fragment size was increased to 250 nt. HOMs were also found to be significantly more accurate than local alignment which is widely relied upon for taxonomic classification of metagenomic sequences. A novel software implementation written in C++ performs classification faster than the existing Markovian metagenomic classifiers and can therefore be used as a standalone classifier or in conjunction with existing taxonomic classifiers for more robust classification of metagenomic sequences. The software has been made available at https://github.com/djburks/SMM. Rajeev.Azad@unt.edu. Supplementary data are available at Bioinformatics online.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Higher-order Markov models for metagenomic sequence classification.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics

Lead the way for us

Journal: Bioinformatics	Publication Date: Jun 9, 2020
Citations: 7

Similar Papers

Generating a New Model for Predicting the Next Accessed Web Page in Web Usage Mining
B Nigam ... S Jain
-
B Nigam, et. al.B Nigam ... S Jain
01 Nov 2010
01 Nov 2010

Taxonomic classification of metagenomic shotgun sequences with CARMA3
Wolfgang Gerlach ... Jens Stoye
Nucleic Acids Research | VOL. 39
Wolfgang Gerlach, et. al.Wolfgang Gerlach ... Jens Stoye
17 May 2011
Nucleic Acids Research | VOL. 39

Selective Markov Models for Predicting Web-Page Accesses
Mukund Deshpande ... George Karypis
-
Mukund Deshpande, et. al.Mukund Deshpande ... George Karypis
05 Apr 2001
05 Apr 2001

PKM3: an optimal Markov model for predicting future navigation sequences of the web surfers
Honey Jindal ... Neetu Sardana
Pattern Analysis and Applications | VOL. 24
Honey Jindal, et. al.Honey Jindal ... Neetu Sardana
24 Jun 2020
Pattern Analysis and Applications | VOL. 24

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Higher-order Markov models for metagenomic sequence classification.

Abstract

Talk to us

Similar Papers

More From: Bioinformatics