Machine Translation Evaluation: Unveiling the Role of Dense Sentence Vector Embedding for Morphologically Rich Language

Samiksha Tripathi,Vineet Kansal

doi:10.1142/s0218001420590016

Samiksha Tripathi, Vineet Kansal

https://doi.org/10.1142/s0218001420590016

Copy DOI

Export

Save

Cite

Abstract
Full-Text
Similar Papers

Abstract

Listen

Machine Translation (MT) evaluation metrics like BiLingual Evaluation Understudy (BLEU) and Metric for Evaluation of Translation with Explicit Ordering (METEOR) are known to have poor performance for word-order and morphologically rich languages. Application of linguistic knowledge to evaluate MTs for morphologically rich language like Hindi as a target language, is shown to be more effective and accurate [S. Tripathi and V. Kansal, Using linguistic knowledge for machine translation evaluation with Hindi as a target language, Comput. Sist.21(4) (2017) 717–724]. Leveraging the recent progress made in the domain of word vector and sentence vector embedding [T. Mikolov and J. Dean, Distributed representations of words and phrases and their compositionality, Adv. Neural Inf. Process. Syst. 2 (2013) 3111–3119], authors have trained a large corpus of pre-processed Hindi text ([Formula: see text] million tokens) for obtaining the word vectors and sentence vector embedding for Hindi. The training has been performed on high end system configuration utilizing Google Cloud platform resources. This sentence vector embedding is further used to corroborate the findings through linguistic knowledge in evaluation metric. For morphologically rich language as target, evaluation metric of MT systems is considered as an optimal solution. In this paper, authors have demonstrated that MT evaluation using sentence embedding-based approach closely mirrors linguistic evaluation technique. The relevant codes used to generate the vector embedding for Hindi have been uploaded on code sharing platform Github. a

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Machine Translation Evaluation: Unveiling the Role of Dense Sentence Vector Embedding for Morphologically Rich Language

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence

Lead the way for us

Journal: International Journal of Pattern Recognition and Artificial Intelligence	Publication Date: May 29, 2019
Citations: 12

Similar Papers

Statistical Analysis of Machine Translation Evaluation Systems for English- Hindi Language Pair
Pooja Malik ... Anurag S. Baghel
Recent Advances in Computer Science and Communications | VOL. 13
Pooja Malik, et. al.Pooja Malik ... Anurag S. Baghel
05 Nov 2020
Recent Advances in Computer Science and Communications | VOL. 13

Automatic Evaluation for Machine Translation
Shashi Pal Singh ... Ajai Kumar
-
Shashi Pal Singh, et. al.Shashi Pal Singh ... Ajai Kumar
01 Jan 2023
01 Jan 2023

Error Classification and Evaluation of Machine Translation Evaluation Metrics for Hindi as a Target Language
Samiksha Tripathi ... Vineet Kansal
-
Samiksha Tripathi, et. al.Samiksha Tripathi ... Vineet Kansal
01 Jan 2023
01 Jan 2023

AdaBLEU: A Modified BLEU Score for Morphologically Rich Languages
Shweta Chauhan ... Abhay Kumar
IETE Journal of Research | VOL. 69
Shweta Chauhan, et. al.Shweta Chauhan ... Abhay Kumar
24 Aug 2021
IETE Journal of Research | VOL. 69

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Machine Translation Evaluation: Unveiling the Role of Dense Sentence Vector Embedding for Morphologically Rich Language

Abstract

Published Version

Talk to us

Similar Papers

More From: International Journal of Pattern Recognition and Artificial Intelligence