Chinese image caption of Inceptionv4 and double-layer GRUs based on attention mechanism

Yongbin Pan,Xiuling Gan,Shukai Duan,Lidan Wang,Liangyi Hong

doi:10.1088/1742-6596/1861/1/012044

Abstract

In recent years, there has been a wave of research on English image caption at home and abroad. However, due to the particularity of Chinese image caption task, the research on Chinese image caption has not made good progress. In order to solve this problem, a new Chinese image caption model is implemented. Firstly, the AI challenge dataset is enhanced, and then the Chinese text data of the dataset is preprocessed by Chinese word segmentation tool word2vec. Secondly, based on the encoder-decoder framework, the image visual features are extracted by Inceptionv4 network, the attention mechanism is incorporated in the process of feature extraction and the Chinese sentences are generated by double-layer GRUs network. In the process of training, Adam is used to optimize the algorithm. Finally, A GUI interface is designed to better show the experimental effect. Experiments show that the new Chinese image caption model can automatically generate more fluent and more accurate Chinese caption sentences, and the trained model has excellent performance in many evaluation indexes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Physics: Conference Series	Publication Date: Mar 1, 2021
Citations: 3	License type: cc-by

R Discovery Prime

R Discovery Prime

Chinese image caption of Inceptionv4 and double-layer GRUs based on attention mechanism

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series

Lead the way for us

Similar Papers

Visuals to Text: A Comprehensive Review on Automatic Image Captioning
Yue Ming ... Nannan Hu
IEEE/CAA Journal of Automatica Sinica | VOL. 9
Yue Ming, et. al.Yue Ming ... Nannan Hu
01 Aug 2022
IEEE/CAA Journal of Automatica Sinica | VOL. 9

Global Visual Feature and Linguistic State Guided Attention for Remote Sensing Image Captioning
Zhengyuan Zhang ... Wenkai Zhang
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60
Zhengyuan Zhang, et. al.Zhengyuan Zhang ... Wenkai Zhang
01 Jan 2021
IEEE Transactions on Geoscience and Remote Sensing | VOL. 60

Sequence Generation with Target Attention
Yingce Xia ... Tie-Yan Liu
-
Yingce Xia, et. al.Yingce Xia ... Tie-Yan Liu
01 Jan 2017
01 Jan 2017

Chinese Image Caption Generation via Visual Attention and Topic Modeling.
Maofu Liu ... Lingjun Li
IEEE Transactions on Cybernetics | VOL. 52
Maofu Liu, et. al.Maofu Liu ... Lingjun Li
22 Jun 2020
IEEE Transactions on Cybernetics | VOL. 52

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Chinese image caption of Inceptionv4 and double-layer GRUs based on attention mechanism

Abstract

Talk to us

Similar Papers

More From: Journal of Physics: Conference Series