Cross-language information retrieval for poetry form of literature-based on machine transliteration using CNN

Ranjana S Jadhav,Manikrao Laxmanrao Dhore

doi:10.3233/jifs-223591

Abstract

Transliteration is phonetically translating a language’s words into an international or non-native screenplay. The machine translation process now plays an essential role in scholarly research. The most crucial complement criterion of the English translation system is preserving the phonetic qualities of the language specification after English translation in the chosen language. However, a suitable bilingual text corpus is necessary for statistical models to attain improved transliteration accuracy. Marathi-to-English direct machine translation is done through a cross-language information retrieval system using the CNN classifier model in this proposed research. The proposed method considers a sequence labelling issue brought on by the split transliteration units used in the process. All half-consonant clusters in the Devanagari script are effectively mapped as half-consonant “a” s and labelled using the Modified Intermediate Phonetic Code (MIPC). After generating the phonetic units for each feature in the base and aim languages, the weight is assigned to a phonetic unit in both languages, and individual phonetic unit probabilities are computed. If the probability is zero, then segments are established and recalculated for each segment based on the target phonetic unit location in the word. Therefore, the proposed approach classifies the required phonetic unit with a high accuracy rate.

Full Text