Discovery of Functional Motifs from the Interface Region of Oligomeric Proteins Using Frequent Subgraph Mining.

Tanay Kumar Saha,Mohammad Al Hasan,Ataur Katebi,Wajdi Dhifli

doi:10.1109/tcbb.2017.2756879

Abstract

Modeling the interface region of a protein complex paves the way for understanding its dynamics and functionalities. Existing works model the interface region of a complex by using different approaches, such as, the residue composition at the interface region, the geometry of the interface residues, or the structural alignment of interface regions. These approaches are useful for ranking a set of docked conformation or for building scoring function for protein-protein docking, but they do not provide a generic and scalable technique for the extraction of interface patterns leading to functional motif discovery. In this work, we model the interface region of a protein complex by graphs and extract interface patterns of the given complex in the form of frequent subgraphs. To achieve this, we develop a scalable algorithm for frequent subgraph mining. We show that a systematic review of the mined subgraphs provides an effective method for the discovery of functional motifs that exist along the interface region of a given protein complex. In our experiments, we use three PDB protein structure datasets. The first two datasets are composed of PDB structures from different conformations of two dimeric protein complexes: HIV-1 protease (329 structures), and triosephosphate isomerase (TIM) (86 structures). The third dataset is a collection of different enzyme structures protein structures from the six top-level enzyme classes, namely: Oxydoreductase, Transferase, Hydrolase, Lyase, Isomerase, and Ligase. We show that for the first two datasets, our method captures the locking mechanism at the dimeric interface by taking into account the spatial positioning of the interfacial residues through graphs. Indeed, our frequent subgraph mining based approach discovers the patterns representing the dimerization lock which is formed at the base of the structure in 323 of the 329 HIV-1 protease structures. Similarly, for 86 TIM structures, our approach discovers the dimerization lock formation in 50 structures. For the enzyme structures, we show that we are able to capture the functional motifs (active sites) that are specific to each of the six top-level classes of enzymes through frequent subgraphs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IEEE/ACM Transactions on Computational Biology and Bioinformatics	Publication Date: Sep 26, 2017
Citations: 58	License type: other-oa

R Discovery Prime

R Discovery Prime

Discovery of Functional Motifs from the Interface Region of Oligomeric Proteins Using Frequent Subgraph Mining.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics

Lead the way for us

Similar Papers

MRFSG-IG: MapReduce based Frequent Subgraph Mining using Integrated Graph Index
Shaik Fathimabi ... Ebenezer Jangam
-
Shaik Fathimabi, et. al.Shaik Fathimabi ... Ebenezer Jangam
23 Mar 2022
23 Mar 2022

RASMA: a reverse search algorithm for mining maximal frequent subgraphs
Saeed Salem ... Mohammad Al Hasan
BioData Mining | VOL. 14
Saeed Salem, et. al.Saeed Salem ... Mohammad Al Hasan
16 Mar 2021
BioData Mining | VOL. 14

Mining frequent subgraphs over uncertain graph databases under probabilistic semantics
Jianzhong Li ... Hong Gao
The VLDB Journal | VOL. 21
Jianzhong Li, et. al.Jianzhong Li ... Hong Gao
28 Feb 2012
The VLDB Journal | VOL. 21

HE-Gaston algorithm for frequent subgraph mining with hadoop framework
D.B Jagannadha Rao ... Vijayakumar Polepally
Expert Systems with Applications | VOL. 251
D.B Jagannadha Rao, et. al.D.B Jagannadha Rao ... Vijayakumar Polepally
19 Apr 2024
Expert Systems with Applications | VOL. 251

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discovery of Functional Motifs from the Interface Region of Oligomeric Proteins Using Frequent Subgraph Mining.

Abstract

Talk to us

Similar Papers

More From: IEEE/ACM Transactions on Computational Biology and Bioinformatics