A complete human verified Turkish caption dataset for MS COCO and performance evaluation with well-known image caption models trained against it

Sina Berk Golech,Elena Battini Sonmez,Hakan Ayral,Saltuk Bugra Karacan

doi:10.1109/iceccme55909.2022.9988025

Abstract

The procedure of generating natural language captions for an image is known as image captioning. Automatic image captioning is a particularly challenging task that stands at the junction of Computer Vision and Natural Language Processing. It has a variety of applications, including text-based image retrieval, assisting visually impaired users, and human-robot interaction. The majority of publications on the subject focus on the English language, which is an analytical language with characteristics differing from the agglutinative Turkish language. This work introduces the Turkish MS COCO dataset that extends the original MS COCO collection with captions in the Turkish language; experimental results surpass the current state-of-the-art for the Turkish image captioning field. Furthermore, the newly introduced database is also applicable for the study of machine translation. On the Turkish MS COCO dataset, the best performance has been achieved with the Meshed Memory Transformers with a Bleu-1 score of 0.72. The database is publicly available at https://github.com/BilgiAILAB/TurkishImageCaptioning. It is desired that the Turkish MS COCO dataset with the proposed benchmark will be an excellent resource for future studies on Turkish image captioning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A complete human verified Turkish caption dataset for MS COCO and performance evaluation with well-known image caption models trained against it

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Chinese Image Caption Generation via Visual Attention and Topic Modeling.
Maofu Liu ... Lingjun Li
IEEE Transactions on Cybernetics | VOL. 52
Maofu Liu, et. al.Maofu Liu ... Lingjun Li
22 Jun 2020
IEEE Transactions on Cybernetics | VOL. 52

Computer Vision and Natural Language Processing
Peratham Wiriyathammabhum ... Yiannis Aloimonos
ACM Computing Surveys | VOL. 49
Peratham Wiriyathammabhum, et. al.Peratham Wiriyathammabhum ... Yiannis Aloimonos
12 Dec 2016
ACM Computing Surveys | VOL. 49

Deep Learning in Natural Language Generation from Images
Xiaodong He ... Li Deng
-
Xiaodong He, et. al.Xiaodong He ... Li Deng
01 Jan 2018
01 Jan 2018

A Review on Automatic Image Captioning Techniques
K C Nithya ... V Vinod Kumar
-
K C Nithya, et. al.K C Nithya ... V Vinod Kumar
01 Jul 2020
01 Jul 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A complete human verified Turkish caption dataset for MS COCO and performance evaluation with well-known image caption models trained against it

Abstract

Talk to us

Similar Papers