LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task.

Neha Warikoo,Yung-Chun Chang,Wen-Lian Hsu

doi:10.1093/database/bay108

Abstract

Identifying the interactions between chemical compounds and genes from biomedical literatures is one of the frequently discussed topics of text mining in the life science field. In this paper, we describe Linguistic Pattern-Aware Dependency Tree Kernel, a linguistic interaction pattern learning method developed for CHEMPROT task–BioCreative VI, to capture chemical–protein interaction (CPI) patterns within biomedical literatures. We also introduce a framework to integrate these linguistic patterns with smooth partial tree kernel to extract the CPIs. This new method of feature representation models aspects of linguistic probability in geometric representation, which not only optimizes the sufficiency of feature dimension for classification, but also defines features as interpretable contexts rather than long vectors of numbers. In order to test the robustness and efficiency of our system in identifying different kinds of biological interactions, we evaluated our framework on three separate data sets, i.e. CHEMPROT corpus, Chemical–Disease Relation corpus and Protein–Protein Interaction corpus. Corresponding experiment results demonstrate that our method is effective and outperforms several compared systems for each data set.

Highlights

Increasing digitization of knowledge over the past decade has resulted in a multiverse of information pool, which can be tapped to explore various characteristic inferences from the data pool; these entity associations can be quantified and analyzed for varied purposes
We participated in BioCreative VI–chemical–protein interaction (CPI) task and developed a Linguistic Pattern-Aware Dependency Tree Kernel (LPTK) model for studying bio-entity association types mentioned between chemicals and proteins
Relation pairs in each test case are based on the combination of all pre-annotated entity pairs described for each instance of the interaction class

Summary

Introduction

Increasing digitization of knowledge over the past decade has resulted in a multiverse of information pool, which can be tapped to explore various characteristic inferences from the data pool; these entity associations can be quantified and analyzed for varied purposes. The pinnacle of such text analysis and information identification hinges on ‘relation. CPI task is a text miningbased task, where PubMed abstracts are studied to identify nature of different interaction types triggered by chemical compounds/drugs interacting with genes/proteins. The task of extracting CPIs has potential implications in automating and upgrading the way precision medicine is conducted

Objectives

Methods

Results

Discussion

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Database	Publication Date: Jan 1, 2018
Citations: 22	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Database

Lead the way for us

Similar Papers

PIPE: a protein-protein interaction passage extraction module for BioCreative challenge.
Yung-Chun Chang ... Yu-Chen Su
Database : the journal of biological databases and curation | VOL. 2016
Yung-Chun Chang, et. al.Yung-Chun Chang ... Yu-Chen Su
01 Jan 2015
Database : the journal of biological databases and curation | VOL. 2016

Inferring complex phylogenies using parsimony: an empirical approach using three large DNA data sets for angiosperms.
Douglas E Soltis ... Mark E Mort
Systematic Biology | VOL. 47
Douglas E Soltis, et. al.Douglas E Soltis ... Mark E Mort
01 Mar 1998
Systematic Biology | VOL. 47

An Interaction Pattern Kernel Approach for Protein-Protein Interaction Extraction from Biomedical Literature
Yung-Chun Chang ... Wen-Lian Hsu
-
Yung-Chun Chang, et. al.Yung-Chun Chang ... Wen-Lian Hsu
01 Jan 2014
01 Jan 2014

A Tree Kernel-Based Method for Protein-Protein Interaction Mining from Biomedical Literature
Jae-Hong Eom ... Byoung-Tak Zhang
-
Jae-Hong Eom, et. al.Jae-Hong Eom ... Byoung-Tak Zhang
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task.

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Database