Tree Kernel Research Articles

Because adverse drug events (ADEs) are a serious health problem and a leading cause of death, it is of vital importance to identify them correctly and in a timely manner. With the development of Web 2.0, social media has become a large data source for information on ADEs. The objective of this study is to develop a relation extraction system that uses natural language processing techniques to effectively distinguish between ADEs and non-ADEs in informal text on social media. We develop a feature-based approach that utilizes various lexical, syntactic, and semantic features. Information-gain-based feature selection is performed to address high-dimensional features. Then, we evaluate the effectiveness of four well-known kernel-based approaches (i.e., subset tree kernel, tree kernel, shortest dependency path kernel, and all-paths graph kernel) and several ensembles that are generated by adopting different combination methods (i.e., majority voting, weighted averaging, and stacked generalization). All of the approaches are tested using three data sets: two health-related discussion forums and one general social networking site (i.e., Twitter). When investigating the contribution of each feature subset, the feature-based approach attains the best area under the receiver operating characteristics curve (AUC) values, which are 78.6%, 72.2%, and 79.2% on the three data sets. When individual methods are used, we attain the best AUC values of 82.1%, 73.2%, and 77.0% using the subset tree kernel, shortest dependency path kernel, and feature-based approach on the three data sets, respectively. When using classifier ensembles, we achieve the best AUC values of 84.5%, 77.3%, and 84.5% on the three data sets, outperforming the baselines. Our experimental results indicate that ADE extraction from social media can benefit from feature selection. With respect to the effectiveness of different feature subsets, lexical features and semantic features can enhance the ADE extraction capability. Kernel-based approaches, which can stay away from the feature sparsity issue, are qualified to address the ADE extraction problem. Combining different individual classifiers using suitable combination methods can further enhance the ADE extraction effectiveness.

Read full abstract

Edit distances provide us with an established method to capture structural features of data, and a distance between data objects represents their dissimilarity. In contrast, kernels form a category of similarity functions, and a positive definite kernel enables us to leverage abundant techniques of multivariate analysis. This paper aims to fill the gap between distances and kernels. In the literature, we have several formulas that convert a negative definite distance function into a positive definite kernel. Edit distance functions, however, are not necessarily negative definite, and our first contribution is to introduce an alternative method to derive positive definite kernels from edit distance functions that are not necessarily negative definite. The method is equipped with an easy-to-check and strong sufficient condition for positive definiteness, and the condition turns out to be tightly related with the triangle inequality. In fact, to our knowledge, all of the edit distance functions in the literature that support the triangle inequality meet the condition for positive definiteness. Secondly, we apply this method to four well-known edit distance functions for trees to introduce four novel kernels and show that three of them are positive definite. Thirdly, we develop a theory of subtree matching to study these kernels. Our kernels count matchings between subtrees of the input trees with weights determined according to individual matchings. Although the number of such matchings is an exponential function of the size of the input trees (the number of vertices), our theory enables us to develop dynamic-programming-based algorithms, whose asymptotic computational complexities fall between a quadratic function and a cubic function of the size.

Read full abstract

Tree Kernel Research Articles

Related Topics

Articles published on Tree Kernel

Matching parse thickets for open domain question answering

The content and emission factors of heavy metals in biomass used for energy purposes in the context of the requirements of international standards

An ensemble method for extracting adverse drug events from social media

Ordered Decompositional DAG kernels enhancements

PIPE: a protein-protein interaction passage extraction module for BioCreative challenge.

Support Vector Machine with Ensemble Tree Kernel for Relation Extraction.

Distributed Smoothed Tree Kernel

Event causality extraction based on connectives analysis

A theory of subtree matching and tree kernels based on the edit distance concept

Phylodynamic Inference with Kernel ABC and Its Application to HIV Epidemiology.

The Research of Improved Spanning Tree Kernel Algorithm for Image Classification

Multi-lingual opinion mining on YouTube

A novel approach to wavelet selection and tree kernel construction for diagnosis of rolling element bearing fault

Towards Topic-to-Question Generation

Recurrent Convolutional Neural Networks for Text Classification

Computation of Program Source Code Similarity by Composition of Parse Tree and Call Graph

Enhancing Predictive Analytics for Anti-Phishing by Exploiting Website Genre Information

Learning Structural Kernels for Natural Language Processing

An efficient topological distance-based tree kernel.

Kernel k-nearest neighbor classifier based on decision tree ensemble for SAR modeling analysis

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Tree Kernel Research Articles

Related Topics

Articles published on Tree Kernel

Matching parse thickets for open domain question answering

The content and emission factors of heavy metals in biomass used for energy purposes in the context of the requirements of international standards

An ensemble method for extracting adverse drug events from social media

Ordered Decompositional DAG kernels enhancements

PIPE: a protein-protein interaction passage extraction module for BioCreative challenge.

Support Vector Machine with Ensemble Tree Kernel for Relation Extraction.

Distributed Smoothed Tree Kernel

Event causality extraction based on connectives analysis

A theory of subtree matching and tree kernels based on the edit distance concept

Phylodynamic Inference with Kernel ABC and Its Application to HIV Epidemiology.

The Research of Improved Spanning Tree Kernel Algorithm for Image Classification

Multi-lingual opinion mining on YouTube

A novel approach to wavelet selection and tree kernel construction for diagnosis of rolling element bearing fault

Towards Topic-to-Question Generation

Recurrent Convolutional Neural Networks for Text Classification

Computation of Program Source Code Similarity by Composition of Parse Tree and Call Graph

Enhancing Predictive Analytics for Anti-Phishing by Exploiting Website Genre Information

Learning Structural Kernels for Natural Language Processing

An efficient topological distance-based tree kernel.

Kernel k-nearest neighbor classifier based on decision tree ensemble for SAR modeling analysis