Image captioning based on deep reinforcement learning

Haichao Shi,Bo Wang,Zhenyu Wang,Peng Li

doi:10.1145/3240876.3240900

Abstract

Recently it has shown that the policy-gradient methods for reinforcement learning have been utilized to train deep end-to-end systems on natural language processing tasks. What's more, with the complexity of understanding image content and diverse ways of describing image content in natural language, image captioning has been a challenging problem to deal with. To the best of our knowledge, most state-of-the-art methods follow a pattern of sequential model, such as recurrent neural networks (RNN). However, in this paper, we propose a novel architecture for image captioning with deep reinforcement learning to optimize image captioning tasks. We utilize two networks called "policy network" and "value network" to collaboratively generate the captions of images. The experiments are conducted on Microsoft COCO dataset, and the experimental results have verified the effectiveness of the proposed method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Image captioning based on deep reinforcement learning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Improving Remote Sensing Image Captioning by Combining Grid Features and Transformer
Shuo Zhuang ... Feng Gao
IEEE Geoscience and Remote Sensing Letters | VOL. 19
Shuo Zhuang, et. al.Shuo Zhuang ... Feng Gao
01 Jan 2021
IEEE Geoscience and Remote Sensing Letters | VOL. 19

Visual saliency for image captioning in new multimedia services
Marcella Cornia ... Rita Cucchiara
-
Marcella Cornia, et. al.Marcella Cornia ... Rita Cucchiara
01 Jul 2017
01 Jul 2017

Multi-Level Policy and Reward-Based Deep Reinforcement Learning Framework for Image Captioning
Ning Xu ... Hanwang Zhang
IEEE Transactions on Multimedia | VOL. 22
Ning Xu, et. al.Ning Xu ... Hanwang Zhang
26 Sep 2019
IEEE Transactions on Multimedia | VOL. 22

Sample effficient deep reinforcement learning for control

-

15 Dec 2019
15 Dec 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Image captioning based on deep reinforcement learning

Abstract

Talk to us

Similar Papers