Utilizing Language Technology in the Documentation of Endangered Uralic Languages

Ciprian Gerstenberger,Michael Rießler,Joshua Wilbur,Niko Partanen

doi:10.3384/nejlt.2000-1533.1643

Abstract

The paper describes work-in-progress by the Pite Saami, Kola Saami and Izhva Komi language documentation projects, all of which record new spoken language data, digitize available recordings and annotate these multimedia data in order to provide comprehensive language corpora as databases for future research on and for endangered – and under-described – Uralic speech communities. Applying language technology in language documentation helps us to create more systematically annotated corpora, rather than eclectic data collections. Specifically, we describe a script providing interactivity between different morphosyntactic analysis modules implemented as Finite State Transducers and ELAN, a Graphical User Interface tool for annotating and presenting multimodal corpora. Ultimately, the spoken corpora created in our projects will be useful for scientifically significant quantitative investigations on these languages in the future.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Northern European Journal of Language Technology	Publication Date: Mar 13, 2016
Citations: 9	License type: CC BY-NC 4.0

R Discovery Prime

R Discovery Prime

Utilizing Language Technology in the Documentation of Endangered Uralic Languages

Abstract

Talk to us

Similar Papers

More From: Northern European Journal of Language Technology

Lead the way for us

Similar Papers

Impact of Language Documentation
Shobhana L Chelliah
-
Shobhana L ChelliahShobhana L Chelliah
01 Jan 2020
01 Jan 2020

Finite state prosodic analysis of african corpus resources
Dafydd Gibbon
-
Dafydd GibbonDafydd Gibbon
03 Sep 2001
03 Sep 2001

Modeling the Noun Morphology of Plains Cree
Conor Snoek ... Sjur Moshagen
-
Conor Snoek, et. al.Conor Snoek ... Sjur Moshagen
01 Jan 2014
01 Jan 2014

Developing without Developers: Choosing Labor-saving Tools for Language Documentation Apps
Luke D Gessler
Proceedings of the Workshop on Computational Methods for Endangered Languages | VOL. 1
Luke D GesslerLuke D Gessler
01 Jan 2019
Proceedings of the Workshop on Computational Methods for Endangered Languages | VOL. 1

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Utilizing Language Technology in the Documentation of Endangered Uralic Languages

Abstract

Talk to us

Similar Papers

More From: Northern European Journal of Language Technology