Deep Image Captioning Survey: A Resource Availability Perspective

Mousa Al Sulaimi,Mohammad Jeragh,Imtiaz Ahmad

doi:10.23919/fruct52173.2021.9435534

Abstract

Recent advances in deep learning have enabled machines to see, hear, and even speak. In some cases, with the help of deep learning, machines have also outperformed humans in these complex tasks. Such improvements have reignited interest in many fields. Image captioning, which is considered an intersection between computer vision and natural language processing, has recently received significant attention. Deep learning-based image captioning models represent a great improvement on traditional methods. However, most of the work done in image captioning is based on supervised deep learning methods. Recently, unsupervised image captioning has started to gather momentum. This paper presents the first survey that focuses on unsupervised and semi-supervised image captioning techniques and methods. Additionally, the survey shows how such methods can be used with different data availability and data pairing settings, where some methods can be used with paired data, while others can be used with unpaired data. Furthermore, special cases of unpaired data, such as cross-domain and cross-lingual image captioning, are also discussed. Finally, the survey presents a discussion on challenges and future research directions of image captioning.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Image Captioning Survey: A Resource Availability Perspective

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An Analysis on Recent Approaches for Image Captioning
Qazi Anwar ... Ch V S Satyamurty
CVR Journal of Science and Technology | VOL. 26
Qazi Anwar, et. al.Qazi Anwar ... Ch V S Satyamurty
01 Jun 2024
CVR Journal of Science and Technology | VOL. 26

Visual Image Captioning through Transformer
Muneeb Nabi ... Apurva Jain
International Journal for Research in Applied Science and Engineering Technology | VOL. 11
Muneeb Nabi, et. al.Muneeb Nabi ... Apurva Jain
31 Dec 2024
International Journal for Research in Applied Science and Engineering Technology | VOL. 11

Deep Learning in Natural Language Generation from Images
Xiaodong He ... Li Deng
-
Xiaodong He, et. al.Xiaodong He ... Li Deng
01 Jan 2018
01 Jan 2018

Image captioning in Turkish with subword units
Menekşe Kuyu ... Aykut Erdem
-
Menekşe Kuyu, et. al.Menekşe Kuyu ... Aykut Erdem
01 May 2018
01 May 2018

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Image Captioning Survey: A Resource Availability Perspective

Abstract

Talk to us

Similar Papers