Improving the Computational Morphological Analysis of a Swahili Corpus for Lexicographic Purposes

G De Pauw,G-M De Schryver

doi:10.4314/lex.v18i1.47257

Abstract

Computational morphological analysis is an important first step in the automatic treatment of natural language and a useful lexicographic tool. This article describes a corpus-based approach to the morphological analysis of Swahili. We particularly focus our discussion on its ability to retrieve lemmas for word forms and evaluate it as a tool for corpus-based dictionary compilation. Keywords: LEXICOGRAPHY, MORPHOLOGY, CORPUS ANNOTATION, LEMMATIZATION, MACHINE LEARNING, SWAHILI (KISWAHILI)

Highlights

Samenvatting: Accuratere computationele morfologische analyse van een Swahili corpus voor lexicografische doeleinden
In De Schryver and De Pauw (2007) it was shown how the fields of natural language processing (NLP) and lexicography can collaborate towards enhancing the functionality of a corpus query package (CQP), by integrating a fast and accurate data-driven part-ofspeech (POS) tagger
We investigate how another typical NLP component — namely morphological analysis — can be developed with a minimal amount of manual effort, and demonstrate how it can be used as a CQP component

Summary

Introduction

Samenvatting: Accuratere computationele morfologische analyse van een Swahili corpus voor lexicografische doeleinden. Through this operation we can automatically induce a morphologically segmented surface and lexical representation of the word form, in which we distinguish a prefix group ([P]), the root morpheme ([R]) and a suffix group ([S]).

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Lexikos	Publication Date: Oct 27, 2009
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

Improving the Computational Morphological Analysis of a Swahili Corpus for Lexicographic Purposes

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Lexikos

Lead the way for us

Similar Papers

Towards Zulu corpus clean-up, lexicon development and corpus annotation by means of computational morphological analysis
Sonja Bosch ... Laurette Pretorius
South African Journal of African Languages | VOL. 31
Sonja Bosch, et. al.Sonja Bosch ... Laurette Pretorius
01 Jan 2010
South African Journal of African Languages | VOL. 31

The significance of computational morphological for Zulu lexicography
Sonja E Bosch ... Laurette Pretorius
South African Journal of African Languages | VOL. 22
Sonja E Bosch, et. al.Sonja E Bosch ... Laurette Pretorius
01 Jan 2002
South African Journal of African Languages | VOL. 22

A Computational Analysis of Arabic Noun Morphology
Hala Mohamed Osman Salih ... Malladi Revathi Devi
International Journal of Linguistics, Literature and Translation | VOL. 6
Hala Mohamed Osman Salih, et. al.Hala Mohamed Osman Salih ... Malladi Revathi Devi
11 Mar 2023
International Journal of Linguistics, Literature and Translation | VOL. 6

Insights into the pathogenesis of cerebral fusiform aneurysms: high-resolution MRI and computational analysis
Ryan Phillip Sabotin ... Adam E Galloy
Journal of NeuroInterventional Surgery | VOL. 13
Ryan Phillip Sabotin, et. al.Ryan Phillip Sabotin ... Adam E Galloy
25 Feb 2021
Journal of NeuroInterventional Surgery | VOL. 13

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving the Computational Morphological Analysis of a Swahili Corpus for Lexicographic Purposes

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Lexikos