Filtration network: A frame sampling strategy via deep reinforcement learning for video captioning

Tiancheng Qian,Zhelei Qiu,Xue Mei,Pengxiang Xu,Kangqi Ge

doi:10.3233/jifs-202249

Abstract

Recently many methods use encoder-decoder framework for video captioning, aiming to translate short videos into natural language. These methods usually use equal interval frame sampling. However, lacking a good efficiency in sampling, it has a high temporal and spatial redundancy, resulting in unnecessary computation cost. In addition, the existing approaches simply splice different visual features on the fully connection layer. Therefore, features cannot be effectively utilized. In order to solve the defects, we proposed filtration network (FN) to select key frames, which is trained by deep reinforcement learning algorithm actor-double-critic. According to behavior psychology, the core idea of actor-double-critic is that the behavior of agent is determined by both the external environment and the internal personality. It avoids the phenomenon of unclear reward and sparse feedback in training because it gives steady feedback after each action. The key frames are sent to combine codec network (CCN) to generate sentences. The operation of feature combination in CCN make fusion of visual features by complex number representation to make good semantic modeling. Experiments and comparisons with other methods on two datasets (MSVD/MSR-VTT) show that our approach achieves better performance in terms of four metrics, BLEU-4, METEOR, ROUGE-L and CIDEr.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Filtration network: A frame sampling strategy via deep reinforcement learning for video captioning

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent & Fuzzy Systems

Lead the way for us

Journal: Journal of Intelligent & Fuzzy Systems	Publication Date: Jan 1, 2021
Citations: 4

Similar Papers

Harnessing deep reinforcement learning algorithms for image categorization: A multi algorithm approach
Dhanvanth Reddy Yerramreddy ... Don S
Engineering Applications of Artificial Intelligence | VOL. 136
Dhanvanth Reddy Yerramreddy, et. al.Dhanvanth Reddy Yerramreddy ... Don S
17 Jul 2024
Engineering Applications of Artificial Intelligence | VOL. 136

Space Manipulator Assembly Operation Technique based on Deep Residual Reinforcement Learning
Kui Huang ... Junyu Quan
Journal of Physics: Conference Series | VOL. 2405
Kui Huang, et. al.Kui Huang ... Junyu Quan
01 Dec 2022
Journal of Physics: Conference Series | VOL. 2405

An explainable deep reinforcement learning algorithm for the parameter configuration and adjustment in the consortium blockchain
Zhonghao Zhai ... Yanqin Mao
Engineering Applications of Artificial Intelligence | VOL. 129
Zhonghao Zhai, et. al.Zhonghao Zhai ... Yanqin Mao
30 Nov 2023
Engineering Applications of Artificial Intelligence | VOL. 129

Collision-avoidance under COLREGS for unmanned surface vehicles via deep reinforcement learning
Yong Ma ... Yuanzhou Zheng
Maritime Policy & Management | VOL. 47
Yong Ma, et. al.Yong Ma ... Yuanzhou Zheng
12 May 2020
Maritime Policy & Management | VOL. 47

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Filtration network: A frame sampling strategy via deep reinforcement learning for video captioning

Abstract

Talk to us

Similar Papers

More From: Journal of Intelligent &amp; Fuzzy Systems

More From: Journal of Intelligent & Fuzzy Systems