Aligning biological sequences by exploiting residue conservation and coevolution.

Anna Paola Muntoni,Andrea Pagnani,Martin Weigt,Francesco Zamponi

doi:10.1103/physreve.102.062409

Abstract

Sequences of nucleotides (for DNA and RNA) or amino acids (for proteins) are central objects in biology. Among the most important computational problems is that of sequence alignment, i.e., arranging sequences from different organisms in such a way to identify similar regions, to detect evolutionary relationships between sequences, and to predict biomolecular structure and function. This is typically addressed through profile models, which capture position specificities like conservation in sequences but assume an independent evolution of different positions. Over recent years, it has been well established that coevolution of different amino-acid positions is essential for maintaining three-dimensional structure and function. Modeling approaches based on inverse statistical physics can catch the coevolution signal in sequence ensembles, and they are now widely used in predicting protein structure, protein-protein interactions, and mutational landscapes. Here, we present DCAlign, an efficient alignment algorithm based on an approximate message-passing strategy, which is able to overcome the limitations of profile models, to include coevolution among positions in a general way, and to be therefore universally applicable to protein- and RNA-sequence alignment without the need of using complementary structural information. The potential of DCAlign is carefully explored using well-controlled simulated data, as well as real protein and RNA sequences.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Aligning biological sequences by exploiting residue conservation and coevolution.

Abstract

Talk to us

Similar Papers

More From: Physical review. E

Lead the way for us

Journal: Physical review. E	Publication Date: Dec 7, 2020
Citations: 70

Similar Papers

Chapter 3 - Machine Learning for Protein Structure and Function Prediction
Robert Ezra Langlois ... Hui Lu
Annual Reports in Computational Chemistry | VOL. 4
Robert Ezra Langlois, et. al.Robert Ezra Langlois ... Hui Lu
01 Jan 2008
Annual Reports in Computational Chemistry | VOL. 4

Selection of appropriate metaheuristic algorithms for protein structure prediction in AB off-lattice model: a perspective from fitness landscape analysis
Nanda Dulal Jana ... Swagatam Das
Information Sciences | VOL. 391-392
Nanda Dulal Jana, et. al.Nanda Dulal Jana ... Swagatam Das
25 Jan 2017
Information Sciences | VOL. 391-392

Diversity of Sequences Folding to Highly and Poorly Designable Structures
Sumudu P Leelananda ... Robert L Jernigan
Biophysical Journal | VOL. 102
Sumudu P Leelananda, et. al.Sumudu P Leelananda ... Robert L Jernigan
01 Jan 2012
Biophysical Journal | VOL. 102

The Application of Artificial Bee Colony Algorithm in Protein Structure Prediction
Yanzhang Li ... Xuedong Zheng
-
Yanzhang Li, et. al.Yanzhang Li ... Xuedong Zheng
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Aligning biological sequences by exploiting residue conservation and coevolution.

Abstract

Talk to us

Similar Papers

More From: Physical review. E