Abstract

It is well known that some enzymes, proteins, amino-acids between other biological molecules have more than one way to be coded in the DNA. That means, there are some biological molecules that can be identified by a set of sequences. For instance, the enzyme SmaI can be recognized by the words [Formula: see text] and [Formula: see text]. In this paper, we count the number of times that a biological sequence occurs through the DNA by any of its configurations, i.e. we provide the strong law of large numbers for a word sequence. To achieve our goal, we consider that DNA is modeled by an ergodic semi-Markov chain. We also present the Central Limit Theorem. Additionally, we compute the first hitting position of a set of words.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call