LARGE-SCALE MULTIPLE INFERENCE OF COLLECTIVE DEPENDENCE WITH APPLICATIONS TO PROTEIN FUNCTION.

Robert Jernigan,Zhao Ren,Wen Zhou,Kejue Jia

doi:10.1214/20-aoas1431

Abstract

Measuring the dependence of k ≥ 3 random variables and drawing inference from such higher-order dependences are scientifically important yet challenging. Motivated here by protein coevolution with multivariate categorical features, we consider an information theoretic measure of higher-order dependence. The proposed collective dependence is a symmetrization of differential interaction information which generalizes the mutual information of a pair of random variables. We show that the collective dependence can be easily estimated and facilitates a test on the dependence of k ≥ 3 random variables. Upon carefully exploring the null space of collective dependence, we devise a Classification-Assisted Large scaLe inference procedure to DEtect significant k-COllective DEpendence among d ≥ k random variables, with the false discovery rate controlled. Finite sample performance of our method is examined via simulations. We apply this method to the multiple protein sequence alignment data to study the residue or position coevolution for two protein families, the elongation factor P family and the zinc knuckle family. We identify novel functional triplets of amino acid residues, whose contributions to the protein function are further investigated. These confirm that the collective dependence does yield additional information important for understanding the protein coevolution compared to the pairwise measures.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LARGE-SCALE MULTIPLE INFERENCE OF COLLECTIVE DEPENDENCE WITH APPLICATIONS TO PROTEIN FUNCTION.

Abstract

Talk to us

Similar Papers

More From: The annals of applied statistics

Lead the way for us

Journal: The annals of applied statistics	Publication Date: Jun 1, 2021
Citations: 1

Similar Papers

Protein co-evolution, co-adaptation and interactions
Florencio Pazos ... Alfonso Valencia
The EMBO Journal | VOL. 27
Florencio Pazos, et. al.Florencio Pazos ... Alfonso Valencia
25 Sep 2008
The EMBO Journal | VOL. 27

Genetic Association, Post-translational Modification, and Protein-Protein Interactions in Type 2 Diabetes Mellitus
Amitabh Sharma ... Dwaipayan Bharadwaj
Molecular & Cellular Proteomics | VOL. 4
Amitabh Sharma, et. al.Amitabh Sharma ... Dwaipayan Bharadwaj
01 Aug 2005
Molecular & Cellular Proteomics | VOL. 4

Sequence Coevolution between RNA and Protein Characterized by Mutual Information between Residue Triplets
Relly Brandman ... Vijay S Pande
PLoS ONE | VOL. 7
Relly Brandman, et. al.Relly Brandman ... Vijay S Pande
18 Jan 2012
PLoS ONE | VOL. 7

A Measure of Monotonicity of two Random Variables
K
Journal of Mathematics and Statistics | VOL. 8
K K
01 Feb 2012
Journal of Mathematics and Statistics | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LARGE-SCALE MULTIPLE INFERENCE OF COLLECTIVE DEPENDENCE WITH APPLICATIONS TO PROTEIN FUNCTION.

Abstract

Talk to us

Similar Papers

More From: The annals of applied statistics