English for the Computer: The SUSANNE Corpus and Analytic Scheme

Geoffrey Sampson

doi:10.1162/089120102317341800

Abstract

Computer processing of natural language is a burgeoning field, but until now there has been no agreement on a standardized classification of the diverse structural elements that occur in real-life language material. This book attempts to define a Linnaean taxonomy for the English language: an annotation scheme, the SUSANNE scheme, which yields a labelled constituency structure for any string of English, comprehensively identifying all of its surface and logical structural properties. The structure is specified with sufficient rigour that analysts working independently must produce identical annotations for a given example. The scheme is based on large sample of real-life use of British and American written and spoken English. The book also describes the SUSANNE electronic corpus of English which is annotated in accordance with the scheme. It is freely available as a research resource to anyone working at a computer conected to Internet, and since 1992 has come into widespread use in academic and commerical research environments on four continents.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

English for the Computer: The SUSANNE Corpus and Analytic Scheme

Abstract

Talk to us

Similar Papers

More From: Computational Linguistics

Lead the way for us

Journal: Computational Linguistics	Publication Date: Mar 1, 2002
Citations: 127

Similar Papers

English for the Computer
Geoffrey Sampson
-
Geoffrey SampsonGeoffrey Sampson
23 Feb 1995
23 Feb 1995

Guest Editors Introduction: Machine Learning in Speech and Language Technologies
Pascale Fung ... Dan Roth
Machine Learning | VOL. 60
Pascale Fung, et. al.Pascale Fung ... Dan Roth
01 Sep 2005
Machine Learning | VOL. 60

Natural Language Processing and Computational Linguistics
Junichi Tsujii
Computational Linguistics | VOL. -
Junichi TsujiiJunichi Tsujii
07 Dec 2021
Computational Linguistics | VOL. -

Examining the Dimensions of Adopting Natural Language Processing and Big Data Analytics Applications in Firms
Sheshadri Chatterjee ... Patrick Mikalef
IEEE Transactions on Engineering Management | VOL. 71
Sheshadri Chatterjee, et. al.Sheshadri Chatterjee ... Patrick Mikalef
01 Jan 2024
IEEE Transactions on Engineering Management | VOL. 71

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

English for the Computer: The SUSANNE Corpus and Analytic Scheme

Abstract

Talk to us

Similar Papers

More From: Computational Linguistics