Automatic Language Identification

Nejla Qafmolla

doi:10.26417/ejls.v7i1.p140-150

Abstract

Automatic Language Identification (LID) is the process of automatically identifying the language of spoken utterance or written material. LID has received much attention due to its application to major areas of research and long-aspired dreams in computational sciences, namely Machine Translation (MT), Speech Recognition (SR) and Data Mining (DM). A considerable increase in the amount of and access to data provided not only by experts but also by users all over the Internet has resulted into both the development of different approaches in the area of LID – so as to generate more efficient systems – as well as major challenges that are still in the eye of the storm of this field. Despite the fact that the current approaches have accomplished considerable success, future research concerning some issues remains on the table. The aim of this paper shall not be to describe the historic background of this field of studies, but rather to provide an overview of the current state of LID systems, as well as to classify the approaches developed to accomplish them. LID systems have advanced and are continuously evolving. Some of the issues that need special attention and improvement are semantics, the identification of various dialects and varieties of a language, identification of spelling errors, data retrieval, multilingual documents, MT and speech-to-speech translation. Methods applied to date have been good from a technical point of view, but not from a semantic one.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: European Journal of Language and Literature	Publication Date: Jan 21, 2017
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

Automatic Language Identification

Abstract

Talk to us

Similar Papers

More From: European Journal of Language and Literature

Lead the way for us

Similar Papers

Automatic language identification and discrimination using the modified group delay feature
R.M Hegde ... H.A Murthy
-
R.M Hegde, et. al.R.M Hegde ... H.A Murthy
14 Nov 2005
14 Nov 2005

Evaluating the Effect of Multiple Filters in Automatic Language Identification without Lexical Knowledge
Guan-Lip Soon ... Nur-Hana Samsudin
International Journal of Advanced Computer Science and Applications | VOL. 11
Guan-Lip Soon, et. al.Guan-Lip Soon ... Nur-Hana Samsudin
01 Jan 2020
International Journal of Advanced Computer Science and Applications | VOL. 11

Unsupervised Deep Language and Dialect Identification for Short Texts
Koustava Goswami ... Theodorus Fransen
-
Koustava Goswami, et. al.Koustava Goswami ... Theodorus Fransen
01 Jan 2020
01 Jan 2020

A comparison between phonetic engine and GMM–UBM classifier for language identification tasks
Sushanta Kabir Dutta ... Tanvira Ismail
Microsystem Technologies | VOL. 28
Sushanta Kabir Dutta, et. al.Sushanta Kabir Dutta ... Tanvira Ismail
06 May 2020
Microsystem Technologies | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Language Identification

Abstract

Talk to us

Similar Papers

More From: European Journal of Language and Literature