Image and Video Captioning with Augmented Neural Architectures

Rakshith Shetty,Hamed R Tavakoli,Jorma Laaksonen

doi:10.1109/mmul.2018.112135923

Image and Video Captioning with Augmented Neural Architectures

Rakshith Shetty, Hamed R Tavakoli + Show 1 more

https://doi.org/10.1109/mmul.2018.112135923

Copy DOI

Journal: IEEE MultiMedia	Publication Date: Apr 1, 2018
Citations: 23

Affiliation: Max Planck Institute for Informatics, Universiti Teknologi MARA, Aalto University

#Video Captioning #Use Of Special Features + Show 7 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Neural-network-based image and video captioning can be substantially improved by utilizing architectures that make use of special features from the scene context, objects, and locations. A novel discriminatively trained evaluator network for choosing the best caption among those generated by an ensemble of caption generator networks further improves accuracy.

Full Text