Convolution kernels with feature selection for natural language processing tasks

Jun Suzuki,Eisaku Maeda,Hideki Isozaki

doi:10.3115/1218955.1218971

Convolution kernels with feature selection for natural language processing tasks

Jun Suzuki, Eisaku Maeda + Show 1 more

Open Access

https://doi.org/10.3115/1218955.1218971

Copy DOI

Publication Date: Jan 1, 2004

Citations: 46

Affiliation: NTT (Japan)

#Natural Language Processing Tasks #Convolution Kernels + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Convolution kernels, such as sequence and tree kernels, are advantageous for both the concept and accuracy of many natural language processing (NLP) tasks. Experiments have, however, shown that the over-fitting problem often arises when these kernels are used in NLP tasks. This paper discusses this issue of convolution kernels, and then proposes a new approach based on statistical feature selection that avoids this issue. To enable the proposed method to be executed efficiently, it is embedded into an original kernel calculation process by using sub-structure mining algorithms. Experiments are undertaken on real NLP tasks to confirm the problem with a conventional method and to compare its performance with that of the proposed method.

Full Text