Investigating backtranslation for the improvement of English-Irish machine translation

Meghan Dowling,Teresa Lynn,Andy Way

doi:10.35903/teanga.v26i0.88

Investigating backtranslation for the improvement of English-Irish machine translation

Meghan Dowling, Teresa Lynn + Show 1 more

Open Access

https://doi.org/10.35903/teanga.v26i0.88

Copy DOI

Journal: TEANGA, the Journal of the Irish Association for Applied Linguistics	Publication Date: Nov 29, 2019
Citations: 3	License type: CC BY-NC-ND 4.0

Affiliation: Dublin City University

#Machine Translation #Domain Specific Data + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

In this paper, we discuss the difficulties of building reliable machine translation systems for the English-Irish (EN-GA) language pair. In the context of limited datasets, we report on assessing the use of backtranslation as a method for creating artificial EN-GA data to increase training data for use state-of-the-art data-driven translation systems. We compare our results to earlier work on EN-GA machine translation by Dowling et al (2016, 2017, 2018) showing that while our own systems do not compare in quality with respect to traditionally reported BLEU metrics, we provide a linguistic analysis to suggest that future work with domain specific data may prove more successful.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: TEANGA, the Journal of the Irish Association for Applied Linguistics

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.