Automatically Extracting Typical Syntactic Differences from Corpora

W Wiersma,T Lauttamus,J Nerbonne

doi:10.1093/llc/fqq017

Abstract

We develop an aggregate measure of syntactic difference for automatically find- ing common syntactic differences between collections of text. With the use of this measure, it is possible to mine for differences between, for example, the English of learners and natives, or between related dialects. If formulated in advance, hypotheses can also be tested for statistical significance. It enables us to find not only absence or presence, but also under- and overuse of specific constructs. We have applied our measure to the English of Finnish immigrants in Australia to look for traces of Finnish grammar in their English. The outcomes of this de- tection process were analysed and found to be insightful. A report is included in this article. Besides explaining our method, we also go into the theory behind it, including permutation statistics, and the custom normalizations required for applying these tests to syntactical data. We also explain how to use the software we developed to apply this method to new corpora, and give some suggestions for further research.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatically Extracting Typical Syntactic Differences from Corpora

Abstract

Talk to us

Similar Papers

More From: Literary and Linguistic Computing

Lead the way for us

Journal: Literary and Linguistic Computing	Publication Date: Oct 11, 2010
Citations: 40

Similar Papers

The Study of English Learning Community in Network Based on SNS
Meijing Li
-
Meijing LiMeijing Li
01 Jan 2015
01 Jan 2015

Vowel Sound Symbols and Schools of Transcriptions
Majid Mohammed Saadoon
Communication and Linguistics Studies | VOL. 4
Majid Mohammed SaadoonMajid Mohammed Saadoon
01 Jan 2018
Communication and Linguistics Studies | VOL. 4

Second language anxiety among Latino American immigrants in Australia
Marta Garcia De Blakeley ... Leanne Casey
International Journal of Bilingual Education and Bilingualism | VOL. 20
Marta Garcia De Blakeley, et. al.Marta Garcia De Blakeley ... Leanne Casey
21 Sep 2015
International Journal of Bilingual Education and Bilingualism | VOL. 20

INVESTIGATING GENDER INFLUENCE ON LANGUAGE LEARNING BELIEFS
...
European Journal of Education Studies | VOL. -
, et. al. ...
25 Oct 2017
European Journal of Education Studies | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatically Extracting Typical Syntactic Differences from Corpora

Abstract

Talk to us

Similar Papers

More From: Literary and Linguistic Computing