Bioinformatics: Strategies, Trends, and Perspectives

Carlos Norberto,Adriane Beatriz de Souza Serapio

doi:10.5772/9441

Abstract

With the advances in the genome area, new techniques and automation processes for DNA sequencing, the amount of data produced has increased exponentially. Analyzing this data, in order to identify interesting biological features, is an enormous challenge, especially if it would be done manually. Think about trying to find a specific word in a book, say Don Quixote, and we have to search word by word. How long it would take? Bioinformatics has played an important role trying to help specialists to analyze data of a specific genome. The application of information technology, associated with techniques from applied mathematics, informatics, statistics, and computer science, has allowed the discovering of interesting and important characteristics in genomes, allowing to understand and solve several biological problems, or even to generate more knowledge or insight about the problem and its involved biological processes, what can bring advances in the used techniques. In Computing area, for example, an ordinary type of task is to process texts. There are several problems involving strings, like trying to find a specific word (we could say “to align words”) or a similar one (considering a particular pattern of characters) in a text. When processing genomic data, if it is desired to search for a specific pattern (and its approximations) in DNA sequences, the natural way is to use solutions already implemented. Thus, for pattern (exact or not) search and similar problems, bioinformaticians have developed computational tools that apply techniques and algorithms well-known in Computing area in order to solve these important genomic problems. Sometimes, they need to adapt algorithms for considering specific features of the biological problem. Two good examples of this case are Sequence Aligning and Sequence Assembly, processes resulting of adaptations in algorithms in order to consider insertion, deletion, and substitution of nucleotides in DNA sequences. Some statistical and computational techniques, such as Hidden Markov Models (HMMs), Stochastic Grammars, and Conditional Random Fields (CRFs) have been successfully applied for modeling, analysis, discovery, classification, and alignment of biological sequences (Yoon & Vaidyanathan, 2004, 2005). HMMs (Rabiner, 1989) and Stochastic Grammars (Sakakibara et al., 1994) are forms of generative models to label sequences, assigning a joint probability distribution of, for example, the gene hidden structure y and the 7

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Bioinformatics: Strategies, Trends, and Perspectives

Abstract

Talk to us

Similar Papers

Lead the way for us

Publication Date: Mar 1, 2010
Citations: 28	License type: cc-by-nc-sa

Similar Papers

A Probabilistic Address Parser Using Conditional Random Fields and Stochastic Regular Grammar
Minlue Wang ... Valeriia Haberland
-
Minlue Wang, et. al.Minlue Wang ... Valeriia Haberland
01 Dec 2016
01 Dec 2016

On Equivalence between Linear-chain Conditional Random Fields and Hidden Markov Chains
Elie Azeraf ... Emmanuel Monfrini
-
Elie Azeraf, et. al.Elie Azeraf ... Emmanuel Monfrini
01 Jan 2021
01 Jan 2021

PRM198 - Extracting Dosage Per Day From Free-Text Medication Prescriptions
M Törnblom ... M Rosenlund
Value in Health | VOL. 19
M Törnblom, et. al.M Törnblom ... M Rosenlund
31 Oct 2016
Value in Health | VOL. 19

Hidden Markov Models and their Applications in Biological Sequence Analysis
Byung-Jun Yoon
Current Genomics | VOL. 10
Byung-Jun YoonByung-Jun Yoon
01 Sep 2009
Current Genomics | VOL. 10

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Bioinformatics: Strategies, Trends, and Perspectives

Abstract

Talk to us

Similar Papers