Effective ambiguity checking in biosequence analysis

Janina Reeder,Peter Steffen,Robert Giegerich

doi:10.1186/1471-2105-6-153

Abstract

BackgroundAmbiguity is a problem in biosequence analysis that arises in various analysis tasks solved via dynamic programming, and in particular, in the modeling of families of RNA secondary structures with stochastic context free grammars. Several types of analysis are invalidated by the presence of ambiguity. As this problem inherits undecidability (as we show here) from the namely problem for context free languages, there is no complete algorithmic solution to the problem of ambiguity checking.ResultsWe explain frequently observed sources of ambiguity, and show how to avoid them. We suggest four testing procedures that may help to detect ambiguity when present, including a just-in-time test that permits to work safely with a potentially ambiguous grammar. We introduce, for the special case of stochastic context free grammars and RNA structure modeling, an automated partial procedure for proving non-ambiguity. It is used to demonstrate non-ambiguity for several relevant grammars.ConclusionOur mechanical proof procedure and our testing methods provide a powerful arsenal of methods to ensure non-ambiguity.

Highlights

Ambiguity is a problem in biosequence analysis that arises in various analysis tasks solved via dynamic programming, and in particular, in the modeling of families of RNA secondary structures with stochastic context free grammars
Our mechanical proof procedure and our testing methods provide a powerful arsenal of methods to ensure non-ambiguity
We have presented testing methods and a partial proof procedure to analyze the semantic ambiguity of SCFGs

Summary

Introduction

Ambiguity is a problem in biosequence analysis that arises in various analysis tasks solved via dynamic programming, and in particular, in the modeling of families of RNA secondary structures with stochastic context free grammars. Several types of analysis are invalidated by the presence of ambiguity As this problem inherits undecidability (as we show here) from the namely problem for context free languages, there is no complete algorithmic solution to the problem of ambiguity checking. The ambiguity problem in biosequence analysis Biosequence analysis problems are typically optimization problems – we seek the best alignment of two protein sequences under a similarity score, or the most stable secondary structure of an RNA molecule under a thermodynamic model. In such a problem, there is a "good" and a "bad" type of ambiguity. In striving for avoidance of ambiguity, we want to get rid of the bad type and retain the good

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC Bioinformatics	Publication Date: Jan 1, 2005
Citations: 35	License type: CC BY 2.0

R Discovery Prime

R Discovery Prime

Effective ambiguity checking in biosequence analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics

Lead the way for us

Similar Papers

GROWTH TRANSFORMATIONS FOR PROBABILISTIC FUNCTIONS OF STOCHASTIC GRAMMARS
Francisco Casacuberta
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 10
Francisco CasacubertaFrancisco Casacuberta
01 May 1996
International Journal of Pattern Recognition and Artificial Intelligence | VOL. 10

Modeling and Interpretation of Multifunction Radars with Stochastic Grammar
A Wang ... V Krishnamurthy
-
A Wang, et. al.A Wang ... V Krishnamurthy
01 Mar 2008
01 Mar 2008

Recursive Markov chains, stochastic grammars, and monotone systems of nonlinear equations
Kousha Etessami ... Mihalis Yannakakis
Journal of the ACM | VOL. 56
Kousha Etessami, et. al.Kousha Etessami ... Mihalis Yannakakis
01 Jan 2009
Journal of the ACM | VOL. 56

Syntactic stochastic processes: Definitions, models, and related inference problems
Francesco Carravetta ... Langford B White
Information and Computation | VOL. 281
Francesco Carravetta, et. al.Francesco Carravetta ... Langford B White
24 Nov 2020
Information and Computation | VOL. 281

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Effective ambiguity checking in biosequence analysis

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC Bioinformatics