Improving German Image Captions Using Machine Translation and Transfer Learning

Rajarshi Biswas,Michael Barz,Daniel Sonntag,Mareike Hartmann

doi:10.1007/978-3-030-89579-2_1

Abstract

Image captioning is a complex artificial intelligence task that involves many fundamental questions of data representation, learning, and natural language processing. In addition, most of the work in this domain addresses the English language because of the high availability of annotated training data compared to other languages. Therefore, we investigate methods for image captioning in German that transfer knowledge from English training data. We explore four different methods for generating image captions in German, two baseline methods and two more advanced ones based on transfer learning. The baseline methods are based on a state-of-the-art model which we train using a translated version of the English MS COCO dataset and the smaller German Multi30K dataset, respectively. Both advanced methods are pre-trained using the translated MS COCO dataset and fine-tuned for German on the Multi30K dataset. One of these methods uses an alternative attention mechanism from the literature that showed a good performance in English image captioning. We compare the performance of all methods for the Multi30K test set in German using common automatic evaluation metrics. We show that our advanced method with the alternative attention mechanism presents a new baseline for German BLEU, ROUGE, CIDEr, and SPICE scores, and achieves a relative improvement of 21.2% in BLEU-4 score compared to the current state-of-the-art in German image captioning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Improving German Image Captions Using Machine Translation and Transfer Learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

A Survey on Evaluation Metrics for Machine Translation
Seungjun Lee ... Seonmin Koo
Mathematics | VOL. 11
Seungjun Lee, et. al.Seungjun Lee ... Seonmin Koo
16 Feb 2023
Mathematics | VOL. 11

Human Versus Automatic Evaluation of NMT for Low-Resource Indian Language
Goutam Datta ... Kusum Gupta
-
Goutam Datta, et. al.Goutam Datta ... Kusum Gupta
01 Jan 2023
01 Jan 2023

Improving Image Captioning with Better Use of Caption
Zhan Shi ... Xu Zhou
-
Zhan Shi, et. al.Zhan Shi ... Xu Zhou
01 Jan 2020
01 Jan 2020

A comparative analysis of lexical-based automatic evaluation metrics for different Indic language pairs
Kiranjeet Kaur ... Shweta Chauhan
Journal of Autonomous Intelligence | VOL. 7
Kiranjeet Kaur, et. al.Kiranjeet Kaur ... Shweta Chauhan
02 Feb 2024
Journal of Autonomous Intelligence | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Improving German Image Captions Using Machine Translation and Transfer Learning

Abstract

Talk to us

Similar Papers