Kernel methods for learning languages

Leonid (Aryeh) Kontorovich,Corinna Cortes,Mehryar Mohri

doi:10.1016/j.tcs.2008.06.037

Abstract

This paper studies a novel paradigm for learning formal languages from positive and negative examples which consists of mapping strings to an appropriate high-dimensional feature space and learning a separating hyperplane in that space. Such mappings can often be represented flexibly with string kernels, with the additional benefit of computational efficiency. The paradigm inspected can thus be viewed as that of using kernel methods for learning languages. We initiate the study of the linear separability of automata and languages by examining the rich class of piecewise-testable languages. We introduce a subsequence feature mapping to a Hilbert space and prove that piecewise-testable languages are linearly separable in that space. The proof makes use of word combinatorial results relating to subsequences. We also show that the positive definite symmetric kernel associated to this embedding is a rational kernel and show that it can be computed in quadratic time using general-purpose weighted automata algorithms. Our examination of the linear separability of piecewise-testable languages leads us to study the general problem of separability with other finite regular covers. We show that all languages linearly separable under a regular finite cover embedding, a generalization of the subsequence embedding we use, are regular. We give a general analysis of the use of support vector machines in combination with kernels to determine a separating hyperplane for languages and study the corresponding learning guarantees. Our analysis includes several additional linear separability results in abstract settings and partial characterizations for the linear separability of the family of all regular languages.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Theoretical Computer Science	Publication Date: Jun 25, 2008
Citations: 46	License type: publisher-specific-oa

R Discovery Prime

R Discovery Prime

Kernel methods for learning languages

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science

Lead the way for us

Similar Papers

Learning Linearly Separable Languages
Leonid Kontorovich ... Mehryar Mohri
-
Leonid Kontorovich, et. al.Leonid Kontorovich ... Mehryar Mohri
01 Jan 2006
01 Jan 2006

Learning Languages with Rational Kernels
Corinna Cortes ... Mehryar Mohri
-
Corinna Cortes, et. al.Corinna Cortes ... Mehryar Mohri
01 Jan 2007
01 Jan 2007

Overlapping Patterns Recognition with Linear and Non-Linear Separations using Positive Definite Kernels
Chiheb-Eddine Benn&Apos;Cir ... Nadia Essoussi
International Journal of Computer Applications | VOL. 56
Chiheb-Eddine Benn&Apos;Cir, et. al.Chiheb-Eddine Benn&Apos;Cir ... Nadia Essoussi
20 Oct 2012
International Journal of Computer Applications | VOL. 56

Visualisation and interpretation of Support Vector Regression models
B Üstün ... L.M.C Buydens
Analytica Chimica Acta | VOL. 595
B Üstün, et. al.B Üstün ... L.M.C Buydens
18 Mar 2007
Analytica Chimica Acta | VOL. 595

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Kernel methods for learning languages

Abstract

Talk to us

Similar Papers

More From: Theoretical Computer Science