Correlations between alignment gaps and nucleotide substitution or amino acid replacement

Tae-Kun Seo,Benjamin D Redelings,Jeffrey L Thorne

doi:10.1073/pnas.2204435119

Abstract

To assess the conventional treatment in evolutionary inference of alignment gaps as missing data, we propose a simple nonparametric test of the null hypothesis that the locations of alignment gaps are independent of the nucleotide substitution or amino acid replacement process. When we apply the test to 1,390 protein alignments that are informed by protein tertiary structure and use a 5% significance level, the null hypothesis of independence between amino acid replacement and gap location is rejected for ∼65% of datasets. Via simulations that include substitution and insertion-deletion, we show that the test performs well with true alignments. When we simulate according to the null hypothesis and then apply the test to optimal alignments that are inferred by each of four widely used software packages, the null hypothesis is rejected too frequently. Via further simulations and analyses, we show that the overly frequent rejections of the null hypothesis are not solely due to weaknesses of widely used software for finding optimal alignments. Instead, our evidence suggests that optimal alignments are unrepresentative of true alignments and that biased evolutionary inferences may result from relying upon individual optimal alignments.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Correlations between alignment gaps and nucleotide substitution or amino acid replacement

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America

Lead the way for us

Journal: Proceedings of the National Academy of Sciences of the United States of America	Publication Date: Aug 16, 2022
License type: cc-by-nc-nd

Similar Papers

Towards optimal alignment of protein structure distance matrices
Inken Wohlers ... Gunnar W Klau
Computer applications in the biosciences : CABIOS | VOL. 26
Inken Wohlers, et. al.Inken Wohlers ... Gunnar W Klau
17 Jul 2010
Computer applications in the biosciences : CABIOS | VOL. 26

Testing for Spatial Clustering of Amino Acid Replacements Within Protein Tertiary Structure
Jiaye Yu ... Jeffrey L Thorne
Journal Of Molecular Evolution | VOL. 62
Jiaye Yu, et. al.Jiaye Yu ... Jeffrey L Thorne
25 Apr 2006
Journal Of Molecular Evolution | VOL. 62

Analysis of Categorical Data with the R Package confreq
Jörg-Henrik Heine ... Mark Stemmler
Psych | VOL. 3
Jörg-Henrik Heine, et. al.Jörg-Henrik Heine ... Mark Stemmler
07 Sep 2021
Psych | VOL. 3

Does fire occurrence modify the probability of being burned again? A null hypothesis test from Mediterranean ecosystems in NE Spain
R Salvador ... J Piñol
Ecological Modelling | VOL. 188
R Salvador, et. al.R Salvador ... J Piñol
12 Mar 2005
Ecological Modelling | VOL. 188

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Correlations between alignment gaps and nucleotide substitution or amino acid replacement

Abstract

Talk to us

Similar Papers

More From: Proceedings of the National Academy of Sciences of the United States of America