Contextual Analysis for Middle Eastern Languages with Hidden Markov Models

Kazem Taghva

doi:10.5121/ijnlc.2015.4401

Abstract

Displaying a document in Middle Eastern languages requires contextual analysis due to different presentational forms for each character of the alphabet. The words of the document will be formed by the joining of the correct positional glyphs representing corresponding presentational forms of the characters. A set of rules defines the joining of the glyphs. As usual, these rules vary from language to language and are subject to interpretation by the software developers. In this paper, we propose a machine learning approach for contextual analysis based on the first order Hidden Markov Model. We will design and build a model for the Farsi language to exhibit this technology. The Farsi model achieves 94% accuracy with the training based on a short list of 89 Farsi vocabularies consisting of 2780 Farsi characters. The experiment can be easily extended to many languages including Arabic, Urdu, and Sindhi. Furthermore, the advantage of this approach is that the same software can be used to perform contextual analysis without coding complex rules for each specific language. Of particular interest is that the languages with fewer speakers can have greater representation on the web, since they are typically ignored by software developers due to lack of financial incentives.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Contextual Analysis for Middle Eastern Languages with Hidden Markov Models

Abstract

Talk to us

Similar Papers

More From: International Journal on Natural Language Computing

Lead the way for us

Journal: International Journal on Natural Language Computing	Publication Date: Aug 30, 2015
Citations: 1

Similar Papers

Algorithms for high order hidden Markov modelling
J.A Du Preez
-
J.A Du PreezJ.A Du Preez
09 Sep 1997
09 Sep 1997

Basic problems and solution methods for two-dimensional continuous 3 × 3 order hidden Markov model
Guo-Gang Wang ... Xiu-Chang Zhu
Chaos, Solitons and Fractals: the interdisciplinary journal of Nonlinear Science, and Nonequilibrium and Complex Phenomena | VOL. 89
Guo-Gang Wang, et. al.Guo-Gang Wang ... Xiu-Chang Zhu
04 Mar 2016
Chaos, Solitons and Fractals: the interdisciplinary journal of Nonlinear Science, and Nonequilibrium and Complex Phenomena | VOL. 89

Tutorial on Hidden Markov Model

Applied and Computational Mathematics | VOL. 6

17 Jun 2016
Applied and Computational Mathematics | VOL. 6

A Better Method for Length Distribution Modeling in HMMs and Its Application to Gene Finding
Broňa Brejová ... Tomáš Vinař
-
Broňa Brejová, et. al.Broňa Brejová ... Tomáš Vinař
01 Jan 2002
01 Jan 2002

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Contextual Analysis for Middle Eastern Languages with Hidden Markov Models

Abstract

Talk to us

Similar Papers

More From: International Journal on Natural Language Computing