When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in Vlogs

Oana Ignat,Rada Mihalcea,Jiajun Bao,Santiago Castro,Dandan Shan,Yuhang Zhou

doi:10.1145/3495211

When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in Vlogs

Oana Ignat, Rada Mihalcea + Show 4 more

Open Access

https://doi.org/10.1145/3495211

Copy DOI

Journal: ACM Transactions on Multimedia Computing, Communications, and Applications	Publication Date: Oct 31, 2022
Citations: 1

Affiliation: University of Michigan–Ann Arbor

#Temporal Localization #Task Of Localization + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

We consider the task of temporal human action localization in lifestyle vlogs. We introduce a novel dataset consisting of manual annotations of temporal localization for 13,000 narrated actions in 1,200 video clips. We present an extensive analysis of this data, which allows us to better understand how the language and visual modalities interact throughout the videos. We propose a simple yet effective method to localize the narrated actions based on their expected duration. Through several experiments and analyses, we show that our method brings complementary information with respect to previous methods, and leads to improvements over previous work for the task of temporal action localization.

Full Text