Parsing, projecting &amp; prototypes

William D Lewis,Fei Xia

doi:10.3115/1609049.1609060

Abstract

Until very recently, most NLP tasks (e.g., parsing, tagging, etc.) have been confined to a very limited number of languages, the so-called majority languages. Now, as the field moves into the era of developing tools for Resource Poor Languages (RPLs)--a vast majority of the world's 7,000 languages are resource poor--the discipline is confronted not only with the algorithmic challenges of limited data, but also the sheer difficulty of locating data in the first place. In this demo, we present a resource which taps the large body of linguistically annotated data on the Web, data which can be repurposed for NLP tasks. Because the field of linguistics has as its mandate the study of human language--in fact, the study of all human languages--and has whole-heartedly embraced the Web as a means for disseminating linguistic knowledge, the consequence is that a large quantity of analyzed language data can be found on the Web. In many cases, the data is richly annotated and exists for many languages for which there would otherwise be very limited annotated data. The resource, the Online Database of INterlinear text (ODIN), makes this data available and provides additional annotation and structure, making the resource useful to the Computational Linguistic audience.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Parsing, projecting & prototypes

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Am I a Resource-Poor Language? Data Sets, Embeddings, Models and Analysis for four different NLP Tasks in Telugu Language
Mounika Marreddy ... Lakshmi Sireesha Vakada
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22
Mounika Marreddy, et. al.Mounika Marreddy ... Lakshmi Sireesha Vakada
25 Nov 2022
ACM Transactions on Asian and Low-Resource Language Information Processing | VOL. 22

Theoretical approaches to universals, variation, and the phonetics/phonology distinction: an introduction
Philip Carr
Language Sciences | VOL. 39
Philip CarrPhilip Carr
30 May 2013
Language Sciences | VOL. 39

Innateness, internalism and input: Chomskyan rationalism and its problems
Philip Carr
Language Sciences | VOL. 25
Philip CarrPhilip Carr
05 Jul 2003
Language Sciences | VOL. 25

Problems and Mysteries in the Study of Human Language
Noam Chomsky
-
Noam ChomskyNoam Chomsky
01 Jan 1976
01 Jan 1976

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Parsing, projecting &amp; prototypes

Abstract

Talk to us

Similar Papers

Parsing, projecting & prototypes