Abstract

We propose the task of unsupervised morphological paradigm completion. Given only raw text and a lemma list, the task consists of generating the morphological paradigms, i.e., all inflected forms, of the lemmas. From a natural language processing (NLP) perspective, this is a challenging unsupervised task, and high-performing systems have the potential to improve tools for low-resource languages or to assist linguistic annotators. From a cognitive science perspective, this can shed light on how children acquire morphological knowledge. We further introduce a system for the task, which generates morphological paradigms via the following steps: (i) EDIT TREE retrieval, (ii) additional lemma retrieval, (iii) paradigm size discovery, and (iv) inflection generation. We perform an evaluation on 14 typologically diverse languages. Our system outperforms trivial baselines with ease and, for some languages, even obtains a higher accuracy than minimally supervised systems.

Highlights

  • Rich languages express syntactic and semantic properties—like tense or case— of words through inflection, i.e., changes to the surface forms of the words

  • We proposed unsupervised morphological paradigm completion, a novel morphological generation task

  • We further developed a system for the task, which performs the following steps: (i) EDIT TREE retrieval, (ii) additional lemma retrieval, (iii) paradigm size discovery, and (iv) inflection generation

Read more

Summary

Introduction

Rich languages express syntactic and semantic properties—like tense or case— of words through inflection, i.e., changes to the surface forms of the words. Rich languages constitute a challenge for natural language processing (NLP) systems: because each lemma can take on a variety of surface forms, the frequency of each individual inflected word decreases drastically. El que tiene oído , oiga lo que el Espíritu dice a las iglesias . ’ ” » Escribe al ángel de la iglesia en Filadelfia : » “ Esto dice el Santo , el Verdadero , el que tiene la llave de David , el que abre y ninguno cierra , y cierra y ninguno abre : El que tiene oído , oiga lo que el Espíritu dice a las iglesias . ’ ” » Escribe al ángel de la iglesia en Filadelfia : » “ Esto dice el Santo , el Verdadero , el que tiene la llave de David , el que abre y ninguno cierra , y cierra y ninguno abre :

Objectives
Results
Conclusion
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.