Discovering Patterns From Sequences Using Pattern-Directed Aligned Pattern Clustering.

Antonio Sze-To,Andrew K C Wong

doi:10.1109/tnb.2018.2845741

Abstract

Functional region identification is of fundamental importance for protein sequences analysis. Such knowledge provides better scientific understanding and could assist drug discovery. Up-to-date, domain annotation is one approach, but it needs to leverage existing databases. For de novo discovery, motif discovery locates and aligns locally homologous sub-sequences to obtain a position-weight matrix (PWM), which is a fixed-length representation model, whereas protein functional region size varies. It thus requires computational expensive exhaustive search to obtain a PWM with width of optimal range. This paper presents a new method known as pattern-directed aligned pattern clustering (PD-APCn) to discover and align patterns in conserved protein functional regions. It adopts aligned pattern cluster (APC) with patterns of variable length and strong support to direct the incremental APC expansion. It allows substitution and frame-shift mutations until a robust termination condition is reached. The concept of breakpoint gap is introduced to identify spots of mutations, such as substitution and frame shifts. Experiments on synthetic data sets with different sizes and noise levels showed that PD-APCn outperforms MEME with much higher recall and Fmeasure and computational speed 665 times faster that MEME. When applying to Cytochrome C and Ubiquitin families, it found all key binding sites within the APCs.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Discovering Patterns From Sequences Using Pattern-Directed Aligned Pattern Clustering.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on nanobioscience

Lead the way for us

Journal: IEEE transactions on nanobioscience	Publication Date: Jun 8, 2018
Citations: 26

Similar Papers

Pattern-directed aligned pattern clustering
Antonio Sze-To ... Andrew K C Wong
-
Antonio Sze-To, et. al.Antonio Sze-To ... Andrew K C Wong
01 Nov 2017
01 Nov 2017

ConSurf: identification of functional regions in proteins by surface-mapping of phylogenetic information.
Fabian Glaser ... Rachel E Bell
Bioinformatics | VOL. 19
Fabian Glaser, et. al.Fabian Glaser ... Rachel E Bell
01 Jan 2003
Bioinformatics | VOL. 19

Correlation of nasal morphology to air-conditioning and clearance function
David E White ... Ahmed M Al-Jumaily
Respiratory Physiology & Neurobiology | VOL. 179
David E White, et. al.David E White ... Ahmed M Al-Jumaily
23 Jul 2011
Respiratory Physiology & Neurobiology | VOL. 179

Identification of Urban Functional Regions Based on Floating Car Track Data and POI Data
Beibei Yu ... Haowei Mu
Sustainability | VOL. 11
Beibei Yu, et. al.Beibei Yu ... Haowei Mu
20 Nov 2019
Sustainability | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Discovering Patterns From Sequences Using Pattern-Directed Aligned Pattern Clustering.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on nanobioscience