Parameter advising for multiple sequence alignment

Dan Deblasio,John Kececioglu

doi:10.1186/1471-2105-16-s2-a3

Abstract

Background While the multiple sequence alignment output by an aligner strongly depends on the parameter values used for its alignment scoring function (i.e. choice of gap penalties and substitution scores), most users rely on the single default parameter setting. A different parameter setting, however, might yield a much higher-quality alignment for a specific set of input sequences. The problem of picking a good choice of parameter values for a given set of input sequences is called parameter advising. A parameter advisor has two ingredients: (i) a set of parameter choices to select from, and (ii) an estimator that estimates the accuracy of a computed alignment; the parameter advisor then picks the parameter choice from the set whose resulting alignment has highest estimated accuracy. Our estimator Facet (Feature-based Accuracy Estimator) is a linear combination of real-valued feature functions of an alignment. We assume the feature functions are given as well as the universe of parameter choices from which the advisor’s set is drawn. For this scenario we define the problem of learning an optimal advisor by finding the best possible parameter set for a collection of training data of reference alignments. Learning optimal advisor sets is NP-complete [1]. For the advisor sets

Highlights

While the multiple sequence alignment output by an aligner strongly depends on the parameter values used for its alignment scoring function (i.e. choice of gap penalties and substitution scores), most users rely on the single default parameter setting
While the multiple sequence alignment output by an aligner strongly depends on the parameter values used for its alignment scoring function, most users rely on the single default parameter setting
Parameter advising We apply parameter advising to boost the true accuracy of the Opal aligner [4,5], where the advisor is using parameter sets found by the -approximation algorithm

Summary

Introduction

While the multiple sequence alignment output by an aligner strongly depends on the parameter values used for its alignment scoring function (i.e. choice of gap penalties and substitution scores), most users rely on the single default parameter setting. Our estimator Facet (Feature-based Accuracy Estimator) is a linear combination of real-valued feature functions of an alignment. We assume the feature functions are given as well as the universe of parameter choices from which the advisor’s set is drawn.

Results

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: BMC bioinformatics	Publication Date: Jan 28, 2015
Citations: 12	License type: cc-by

R Discovery Prime

R Discovery Prime

Parameter advising for multiple sequence alignment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics

Lead the way for us

Similar Papers

Learning Parameter-Advising Sets for Multiple Sequence Alignment.
Dan Deblasio ... John Kececioglu
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 14
Dan Deblasio, et. al.Dan Deblasio ... John Kececioglu
01 Sep 2017
IEEE/ACM transactions on computational biology and bioinformatics | VOL. 14

Learning parameter sets for alignment advising
Dan Deblasio ... John Kececioglu
-
Dan Deblasio, et. al.Dan Deblasio ... John Kececioglu
20 Sep 2014
20 Sep 2014

Running W ILD: the case for exploring mixed parameter sets in sensitivity analysis.
Prashant P Sharma ... Gonzalo Giribet
Cladistics : the international journal of the Willi Hennig Society | VOL. 27
Prashant P Sharma, et. al.Prashant P Sharma ... Gonzalo Giribet
15 Dec 2010
Cladistics : the international journal of the Willi Hennig Society | VOL. 27

Adaptive Local Realignment via Parameter Advising
Dan Deblasio ... John Kececioglu
-
Dan Deblasio, et. al.Dan Deblasio ... John Kececioglu
02 Oct 2016
02 Oct 2016

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parameter advising for multiple sequence alignment

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: BMC bioinformatics