Converting System of Phonetics Transcriptions to Myanmar Text Using N-grams Language Models

Kyaw Kyaw Maung

doi:10.32628/ijsrset151356

Abstract

Converting between Phonetics transcriptions and Myanmar text is a process of converting between the sequence of Phonetics transcriptions and Myanmar text. Phonetics transcription is based on the pronunciation of the language and the Myanmar text is based on the written language. One Phonetics alphabet can be represented many possible forms in written language that leads into word sense ambiguity problem. Another problem is that both of the Phonetics transcriptions and Myanmar text have no space to identify the boundary of syllables and words. This problem can be defined as segmentation problem for matching and mapping between Phonetics transcriptions and Myanmar text. To solve the word-sense ambiguity problem, the research developed n-grams language models from correct training data in Myanmar language. By using these trained n-grams language models, the system can be converted from Phonetics to Myanmar text. Instead of computing the probability on the trained n-grams data, the system matched the input data and the trained n-grams model data. The system has built n-grams models where unigram model, bi-grams model, trigrams model, 4-grams models and 5-grams models to train and convert between Phonetics and Myanmar text. To solve the segmentation problem, the system needed to break the input text into individual tokens. In the system, each token may be represented the consonant, or consonant clusters or vowels. To segment the input text Myanmar text or Phonetics transcriptions correctly, the proposed used the Unicode fonts for both Myanmar text and Phonetics transcriptions.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Converting System of Phonetics Transcriptions to Myanmar Text Using N-grams Language Models

Abstract

Talk to us

Similar Papers

More From: International journal of scientific research in science, engineering and technology

Lead the way for us

Journal: International journal of scientific research in science, engineering and technology	Publication Date: Jun 13, 2015
Citations: 2

Similar Papers

Influence of language models and candidate set size on contextual post-processing for Chinese script recognition
...
-
, et. al. ...
23 Aug 2004
23 Aug 2004

Federated Learning of N-Gram Language Models
Adeline Wong ... Ananda Theertha Suresh
-
Adeline Wong, et. al.Adeline Wong ... Ananda Theertha Suresh
01 Jan 2019
01 Jan 2019

Joint unsupervised adaptation of n-gram and RNN language models via LDA-based hybrid mixture modeling
Yushi Aono ... Ryo Masumura
-
Yushi Aono, et. al.Yushi Aono ... Ryo Masumura
01 Dec 2017
01 Dec 2017

An Empirical Comparison Between N-gram and Syntactic Language Models for Word Ordering
Yue Zhang ... Jiangming Liu
-
Yue Zhang, et. al.Yue Zhang ... Jiangming Liu
01 Jan 2015
01 Jan 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Converting System of Phonetics Transcriptions to Myanmar Text Using N-grams Language Models

Abstract

Talk to us

Similar Papers

More From: International journal of scientific research in science, engineering and technology