Language Resource Efficient Learning for Captioning

Yike Wu

doi:10.48448/etv2-s970

Abstract

Due to complex cognitive and inferential efforts involved in the manual generation of one caption per image/video input, the human annotation resources are very limited for captioning tasks. We define language resource efficient as reaching the same performance with fewer annotated captions per input. We first study the performance degradation of caption models in different language resource settings. Our analysis of caption models with SC loss shows that the performance degradation is caused by the increasingly noisy estimation of reward and baseline with fewer language resources. To mitigate this issue, we propose to reduce the variance of noise in the baseline by generalizing the single pairwise comparison in SC loss and using multiple generalized pairwise comparisons. The generalized pairwise comparison (GPC) measures the difference between the evaluation scores of two captions with respect to an input. Empirically, we show that the model trained with the proposed GPC loss is efficient on language resource and achieves similar performance with the state-of-the-art models on MSCOCO by using only half of the language resources. Furthermore, our model significantly outperforms the state-of-the-art models on a video caption dataset that has only one labeled caption per input in the training set.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Language Resource Efficient Learning for Captioning

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Positive evaluation in the language of human resources: textbooks versus reality
Pavel Reich
Language Learning in Higher Education | VOL. 11
Pavel ReichPavel Reich
26 May 2021
Language Learning in Higher Education | VOL. 11

Nenek: a cloud-based collaboration platform for the management of Amerindian language resources
J L Gonzalez ... Jesus Carretero
Language Resources and Evaluation | VOL. 51
J L Gonzalez, et. al.J L Gonzalez ... Jesus Carretero
16 Jun 2016
Language Resources and Evaluation | VOL. 51

Zero-Shot Cross-Lingual Transfer in Legal Domain Using Transformer Models
Zein Shaheen ... Dmitry Mouromtsev
-
Zein Shaheen, et. al.Zein Shaheen ... Dmitry Mouromtsev
01 Dec 2021
01 Dec 2021

Developing Multilingual Capacity: A Challenge for the Multinational Enterprise
Denice E Welch ... Lawrence S Welch
Journal of Management | VOL. 44
Denice E Welch, et. al.Denice E Welch ... Lawrence S Welch
08 Jul 2015
Journal of Management | VOL. 44

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Language Resource Efficient Learning for Captioning

Abstract

Talk to us

Similar Papers