A Systematic Literature Review on Using the Encoder-Decoder Models for Image Captioning in English and Arabic Languages

Ashwaq Alsayed,Saud Alotaibi,Muhammad Arif,Thamir M Qadah

doi:10.3390/app131910894

Ashwaq Alsayed, Saud Alotaibi + Show 2 more

Open Access

https://doi.org/10.3390/app131910894

Copy DOI

Journal: Applied Sciences	Publication Date: Sep 30, 2023
Citations: 2	License type: CC BY 4.0

Affiliation: Umm al-Qura University

Abstract

With the explosion of visual content on the Internet, creating captions for images has become a necessary task and an exciting topic for many researchers. Furthermore, image captioning is becoming increasingly important as the number of people utilizing social media platforms grows. While there is extensive research on English image captioning (EIC), studies focusing on image captioning in other languages, especially Arabic, are limited. There has also yet to be an attempt to survey Arabic image captioning (AIC) systematically. This research aims to systematically survey encoder-decoder EIC while considering the following aspects: visual model, language model, loss functions, datasets, evaluation metrics, model comparison, and adaptability to the Arabic language. A systematic review of the literature on EIC and AIC approaches published in the past nine years (2015–2023) from well-known databases (Google Scholar, ScienceDirect, IEEE Xplore) is undertaken. We have identified 52 primary English and Arabic studies relevant to our objectives (The number of articles on Arabic captioning is 11, and the rest are for the English language). The literature review shows that applying the English-specific models to the Arabic language is possible, with the use of a high-quality Arabic database and following the appropriate preprocessing. Moreover, we discuss some limitations and ideas to solve them as a future direction.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Systematic Literature Review on Using the Encoder-Decoder Models for Image Captioning in English and Arabic Languages

Abstract

Talk to us

Similar Papers

More From: Applied Sciences

Lead the way for us

Similar Papers

Arteriovenous malformations of the corpus callosum: Pooled analysis and systematic review of literature
Aqueelh Pabaney ... Rushna Ali
Surgical Neurology International | VOL. 7
Aqueelh Pabaney, et. al.Aqueelh Pabaney ... Rushna Ali
01 Jan 2015
Surgical Neurology International | VOL. 7

The Expanding Role of ChatGPT (Chat-Generative Pre-Trained Transformer) in Neurosurgery: A Systematic Review of Literature and Conceptual Framework.
Alex Roman ... Lubna Al-Sharif
Cureus | VOL. 15
Alex Roman, et. al.Alex Roman ... Lubna Al-Sharif
15 Aug 2023
Cureus | VOL. 15

Acute Pancreatitis: What Is It, Why Is It on the Rise, and What Are the Current Nutrition Recommendations?
Christie Heinzman ... Lin Fei
Journal of the Academy of Nutrition and Dietetics | VOL. 118
Christie Heinzman, et. al.Christie Heinzman ... Lin Fei
04 May 2017
Journal of the Academy of Nutrition and Dietetics | VOL. 118

“TEG” talks: technology worth spreading?
Rita Selby
Research and Practice in Thrombosis and Haemostasis | VOL. 7
Rita SelbyRita Selby
01 Jan 2023
Research and Practice in Thrombosis and Haemostasis | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Systematic Literature Review on Using the Encoder-Decoder Models for Image Captioning in English and Arabic Languages

Abstract

Talk to us

Similar Papers

More From: Applied Sciences