Factors That Influence the Choice of Markov Model Order in Discriminating DNA Sequences from Different Sources.

Ravi S Pandey,Rajeev K Azad

doi:10.1089/omi.2022.0043

Abstract

Markov models have frequently been used in genetic sequence analysis. The number of parameters of a Markov model increases exponentially with model order, so it is often recommended that the order be chosen based on the size of data being modeled, lower orders for small and higher orders for large dataset sizes. Approaches based on model selection criterion have also been proposed. An important problem in microbiology and evolutionary biology is to decipher chimeric genomes of microbes, particularly, identify segments of distinct ancestries in genomes and reconstruct the plausible evolutionary scenarios that might have shaped the chimeric genomes in the microbial world. In this study, we assessed a Markov model-based segmentation method for its ability to detect compositionally disparate segments in chimeric sequence constructs as a function of model order, sequence length, and phylogenetic divergence. Our results show that the choice of Markov model order depends on both sequence size and composition. Higher order Markov models were found to be more effective in delineating sequence segments arising from closely related organisms in longer constructs; on the other hand, lower order Markov models were found to be more appropriate in delineating sequence segments arising from distantly related organisms in shorter constructs. These findings are important and timely, with broad implications in fields such as epidemiology that has to deal with the emergence of novel pathogenic chimeras that arise by foreign DNA acquisition, and ecology where chimeric structures may arise in various ecosystems, necessitating more robust approaches for their deconstruction and interpretation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Factors That Influence the Choice of Markov Model Order in Discriminating DNA Sequences from Different Sources.

Abstract

Talk to us

Similar Papers

More From: Omics : a journal of integrative biology

Lead the way for us

Similar Papers

Improving Prediction of Tobacco Use Over Time: Findings from Waves 1-4 of the Population Assessment of Tobacco and Health Study.
Sarah D Mills ... Kristen Hassmiller Lich
Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco | VOL. 26
Sarah D Mills, et. al.Sarah D Mills ... Kristen Hassmiller Lich
06 Sep 2023
Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco | VOL. 26

Selective Markov Models for Predicting Web-Page Accesses
Mukund Deshpande ... George Karypis
-
Mukund Deshpande, et. al.Mukund Deshpande ... George Karypis
05 Apr 2001
05 Apr 2001

When History and Heterogeneity Matter: A Tutorial on the Impact of Markov Model Specifications in the Context of Colorectal Cancer Screening.
Rachel M Townsley ... Kristen Hasmiller Lich
Medical Decision Making | VOL. 42
Rachel M Townsley, et. al.Rachel M Townsley ... Kristen Hasmiller Lich
11 May 2022
Medical Decision Making | VOL. 42

A Markov Chain Model with High-Order Hidden Process and Mixture Transition Distribution
Sheng-Na Zhang ... Lei Wu
-
Sheng-Na Zhang, et. al.Sheng-Na Zhang ... Lei Wu
01 Dec 2013
01 Dec 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Factors That Influence the Choice of Markov Model Order in Discriminating DNA Sequences from Different Sources.

Abstract

Talk to us

Similar Papers

More From: Omics : a journal of integrative biology