Deep Learning in Natural Language Generation from Images

Xiaodong He,Li Deng

doi:10.1007/978-981-10-5209-5_10

Abstract

Natural language generation from images, referred to as image or visual captioning also, is an emerging deep learning application that is in the intersection between computer vision and natural language processing. Image captioning also forms the technical foundation for many practical applications. The advances in deep learning technologies have created significant progress in this area in recent years. In this chapter, we review the key developments in image captioning and their impact in both research and industry deployment. Two major schemes developed for image captioning, both based on deep learning, are presented in detail. A number of examples of natural language descriptions of images produced by two state-of-the-art captioning systems are provided to illustrate the high quality of the systems’ outputs. Finally, recent research on generating stylistic natural language from images is reviewed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Deep Learning in Natural Language Generation from Images

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

An Analysis on Recent Approaches for Image Captioning
Qazi Anwar ... Ch V S Satyamurty
CVR Journal of Science and Technology | VOL. 26
Qazi Anwar, et. al.Qazi Anwar ... Ch V S Satyamurty
01 Jun 2024
CVR Journal of Science and Technology | VOL. 26

Image Captioning Based on Semantic Scenes.
Fengzhi Zhao ... Yi Lv
Entropy (Basel, Switzerland) | VOL. 26
Fengzhi Zhao, et. al.Fengzhi Zhao ... Yi Lv
18 Oct 2024
Entropy (Basel, Switzerland) | VOL. 26

Deep Image Captioning Survey: A Resource Availability Perspective
Mousa Al Sulaimi ... Imtiaz Ahmad
-
Mousa Al Sulaimi, et. al.Mousa Al Sulaimi ... Imtiaz Ahmad
12 May 2021
12 May 2021

Visual Image Captioning through Transformer
Muneeb Nabi ... Apurva Jain
International Journal for Research in Applied Science and Engineering Technology | VOL. 11
Muneeb Nabi, et. al.Muneeb Nabi ... Apurva Jain
31 Dec 2024
International Journal for Research in Applied Science and Engineering Technology | VOL. 11

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Deep Learning in Natural Language Generation from Images

Abstract

Talk to us

Similar Papers