Learning syntax by automata induction

Robert C Berwick,Sam Pilato

doi:10.1007/bf00058753

Abstract

In this paper we propose an explicit computer model for learning natural language syntax based on Angluin's (1982) efficient induction algorithms, using a complete corpus of grammatical example sentences. We use these results to show how inductive inference methods may be applied to learn substantial, coherent subparts of at least one natural language — English — that are not susceptible to the kinds of learning envisioned in linguistic theory. As two concrete case studies, we show how to learn English auxiliary verb sequences (such as could be taking, will have been taking) and the sequences of articles and adjectives that appear before noun phrases (such as the very old big deer). Both systems can be acquired in a computationally feasible amount of time using either positive examples, or, in an incremental mode, with implicit negative examples (examples outside a finite corpus are considered to be negative examples). As far as we know, this is the first computer procedure that learns a full-scale range of noun subclasses and noun phrase structure. The generalizations and the time required for acquisition match our knowledge of child language acquisition for these two cases. More importantly, these results show that just where linguistic theories admit to highly irregular subportions, we can apply efficient automata-theoretic learning algorithms. Since the algorithm works only for fragments of language syntax, we do not believe that it suffices for all of language acquisition. Rather, we would claim that language acquisition is nonuniform and susceptible to a variety of acquisition strategies; this algorithm may be one these.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning syntax by automata induction

Abstract

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Journal: Machine Learning	Publication Date: Jan 1, 1987
Citations: 21

Similar Papers

Learning Syntax by Automata Induction
Robert C Berwick ... Sam Pilato
Machine Learning | VOL. 2
Robert C Berwick, et. al.Robert C Berwick ... Sam Pilato
01 Jan 1987
Machine Learning | VOL. 2

Generalization with Precision: The Role of Negative Teaching Examples in the Instruction of Generalized Grocery Item Selection
Robert H Horner ... Richard W Albin
Journal of the Association for Persons with Severe Handicaps | VOL. 11
Robert H Horner, et. al.Robert H Horner ... Richard W Albin
01 Dec 1986
Journal of the Association for Persons with Severe Handicaps | VOL. 11

Clustering-based Method for Positive and Unlabeled Text Categorization Enhanced by Improved TFIDF
...
Journal of Information Science and Engineering | VOL. 30
, et. al. ...
01 Sep 2014
Journal of Information Science and Engineering | VOL. 30

<title>Relevance Feedback in Image Retrieval: A New Approach using Positive and Negative Examples</title>
Mohammed L Kherfi ... Alan Bernardi
-
Mohammed L Kherfi, et. al.Mohammed L Kherfi ... Alan Bernardi
20 Jan 2003
20 Jan 2003

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning syntax by automata induction

Abstract

Talk to us

Similar Papers

More From: Machine Learning