Towards controllable image descriptions with semi-supervised VAE

Nikolai Zakharov,Hang Su,Jun Zhu,Jan Gläscher

doi:10.1016/j.jvcir.2019.102574

Towards controllable image descriptions with semi-supervised VAE

Nikolai Zakharov, Hang Su + Show 2 more

https://doi.org/10.1016/j.jvcir.2019.102574

Copy DOI

Journal: Journal of Visual Communication and Image Representation	Publication Date: Jul 8, 2019
Citations: 3

Affiliation: Tsinghua University, University Medical Center Hamburg-Eppendorf

#Image Captioning #Unlabelled Data + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Image captioning models successfully describe the visual contents of images using natural language. To generate more natural and diverse descriptions, a model must learn style-specific patterns and requires collecting style-specific datasets, which is time-consuming. To address this issue, we propose a semi-supervised deep generative model, Semi-supervised Conditional Variational Auto-Encoder (SCVAE). Our model is capable of leveraging more labelled and unlabelled data in the generative model schema. Extensive empirical results demonstrate that compared with the start-of-art models, our proposed method is able to generate more accurate image captions with more extensive styles.

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: Journal of Visual Communication and Image Representation

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.