Implementing EM and Viterbi algorithms for Hidden Markov Model in linear memory

Alexander Churbanov,Stephen Winters-Hilt

doi:10.1186/1471-2105-9-224

Abstract

BackgroundThe Baum-Welch learning procedure for Hidden Markov Models (HMMs) provides a powerful tool for tailoring HMM topologies to data for use in knowledge discovery and clustering. A linear memory procedure recently proposed by Miklós, I. and Meyer, I.M. describes a memory sparse version of the Baum-Welch algorithm with modifications to the original probabilistic table topologies to make memory use independent of sequence length (and linearly dependent on state number). The original description of the technique has some errors that we amend. We then compare the corrected implementation on a variety of data sets with conventional and checkpointing implementations.ResultsWe provide a correct recurrence relation for the emission parameter estimate and extend it to parameter estimates of the Normal distribution. To accelerate estimation of the prior state probabilities, and decrease memory use, we reverse the originally proposed forward sweep. We describe different scaling strategies necessary in all real implementations of the algorithm to prevent underflow. In this paper we also describe our approach to a linear memory implementation of the Viterbi decoding algorithm (with linearity in the sequence length, while memory use is approximately independent of state number). We demonstrate the use of the linear memory implementation on an extended Duration Hidden Markov Model (DHMM) and on an HMM with a spike detection topology. Comparing the various implementations of the Baum-Welch procedure we find that the checkpointing algorithm produces the best overall tradeoff between memory use and speed. In cases where sequence length is very large (for Baum-Welch), or state number is very large (for Viterbi), the linear memory methods outlined may offer some utility.ConclusionOur performance-optimized Java implementations of Baum-Welch algorithm are available at . The described method and implementations will aid sequence alignment, gene structure prediction, HMM profile training, nanopore ionic flow blockades analysis and many other domains that require efficient HMM training with EM.

Highlights

The Baum-Welch learning procedure for Hidden Markov Models (HMMs) provides a powerful tool for tailoring HMM topologies to data for use in knowledge discovery and clustering
Hidden Markov Models (HMMs) are a widely accepted modeling tool [1] used in various domains, such as speech recognition [2] and bioinformatics [3]
The HMM can be represented as a directed graph with N states where each state can emit either a discrete character or a continuous value drawn from a Probability Density Function (PDF)

Summary

Methodology article

Published: 30 April 2008 BMC Bioinformatics 2008, 9:224 doi:10.1186/1471-2105-9-224

Results

Conclusion

Background

Methods and Results

Backward procedure

23 Termination

Discussion and Conclusion

Bilmes J

18. Viterbi A

24. Rahimi A

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Apr 30, 2008
Citations: 58	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Implementing EM and Viterbi algorithms for Hidden Markov Model in linear memory

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

Hidden 1-Counter Markov Models and How to Learn Them
Mehmet Kurucan ... Mete Özbaltan
-
Mehmet Kurucan, et. al.Mehmet Kurucan ... Mete Özbaltan
01 Jul 2022
01 Jul 2022

Hidden Abstract Stack Markov Models with Learning Process
Mete Özbaltan
Mathematics | VOL. 12
Mete ÖzbaltanMete Özbaltan
08 Jul 2024
Mathematics | VOL. 12

A Parallel Implementation of a Hidden Markov Model with Duration Modeling for Speech Recognition
C.D Mitchell ... R.A Helzerman
Digital Signal Processing | VOL. 5
C.D Mitchell, et. al.C.D Mitchell ... R.A Helzerman
01 Jan 1995
Digital Signal Processing | VOL. 5

Optimisation of HMM Topologies Enhances DNA and Protein Sequence Modelling
Torben Friedrich ... Tobias Müller
Statistical Applications in Genetics and Molecular Biology | VOL. 9
Torben Friedrich, et. al.Torben Friedrich ... Tobias Müller
06 Jan 2010
Statistical Applications in Genetics and Molecular Biology | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Implementing EM and Viterbi algorithms for Hidden Markov Model in linear memory

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics