Image caption generation with high-level image features

Songtao Ding,Shiru Qu,Yuling Xi,Arun Kumar Sangaiah,Shaohua Wan

doi:10.1016/j.patrec.2019.03.021

Image caption generation with high-level image features

Songtao Ding, Shiru Qu + Show 3 more

https://doi.org/10.1016/j.patrec.2019.03.021

Copy DOI

Journal: Pattern Recognition Letters	Publication Date: Mar 26, 2019
Citations: 50

Affiliation: Northwestern Polytechnical University, Vellore Institute of Technology University, Zhongnan University of Economics and Law

#High-level Features #High-level Image Features + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Recently, caption generation has raised a huge interests in images and videos. However, it is challenging for the models to select proper subjects in a complex background and generate desired captions in high-level vision tasks. Inspired by recent works, we propose a novel image captioning model based on high-level image features. We combine low-level information, such as image quality, with high-level features, such as motion classification and face recognition to detect attention regions of an image. We demonstrate that our attention model produces good performance in experiments on MSCOCO, Flickr 30K, PASCL and SBU datasets.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Pattern Recognition Letters

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.