Revisiting Hard Example for Action Recognition

Jinpeng Wang,Zhihao Yuan,Shiren Li,Jianguo Hu

doi:10.1109/tcsvt.2020.2978855

Abstract

Video-based action recognition, which needs to handle temporal motion and spatial cues simultaneously, remains a challenging task. In this paper, our motivation is to address this issue by fully utilizing temporal information. Specially, a novel light-weight Voting-based Temporal Correlation (VTC) module is proposed to enhance temporal information. Multiple branches with different temporal sampling intervals are included in this module and they are regarded as voters. The final classification result is “voted” by these branches together. VTC module integrates sparse temporal sampling strategy into feature sequences, so it mitigates the effect of redundant information and focuses more on temporal modeling. Additionally, we propose a simple and intuitive Similarity Loss (SL) to guide the training procedure of the VTC module and the backbone network. When we introduce confusion in the predicted vector intentionally, SL eases intra-class variation by discovering class-specific common motion patterns rather than sample-specific discriminative information. SL neither needs excessive parameter tuning during training nor adds significant computation overhead during test time. By combining VTC module and SL with complementary advances in the field, we clearly outperform state-of-the-art results and achieve 83.0, 98.4, 49.6 and 77.8 accuracy on HMDB51, UCF101, something-something-v1, and Kinetics respectively.

Full Text

Published version (

Free)

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Revisiting Hard Example for Action Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society

Lead the way for us

Journal: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society	Publication Date: Mar 13, 2020
Citations: 35

Similar Papers

Rethinking Temporal-Related Sample for Human Action Recognition
Jinpeng Wang ... Shiren Li
-
Jinpeng Wang, et. al.Jinpeng Wang ... Shiren Li
11 Apr 2020
11 Apr 2020

The effect of temporal sampling intervals on typical human mobility indicators obtained from mobile phone location data
Zhiyuan Zhao ... Sheng Wu
International Journal of Geographical Information Systems | VOL. 33
Zhiyuan Zhao, et. al.Zhiyuan Zhao ... Sheng Wu
15 Apr 2019
International Journal of Geographical Information Systems | VOL. 33

Effects of Temporal Sampling Interval on the Moon-Based Earth Observation Geometry
Hanlin Ye ... Huadong Guo
IEEE journal of selected topics in applied earth observations and remote sensing | VOL. 13
Hanlin Ye, et. al.Hanlin Ye ... Huadong Guo
01 Jan 2020
IEEE journal of selected topics in applied earth observations and remote sensing | VOL. 13

Uncertainties in Mean River Discharge Estimates Associated With Satellite Altimeter Temporal Sampling Intervals: A Case Study for the Annual Peak Flow in the Context of the Future SWOT Hydrology Mission
F Papa ... S Biancamaria
IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society | VOL. 9
F Papa, et. al.F Papa ... S Biancamaria
01 Jul 2012
IEEE transactions on geoscience and remote sensing : a publication of the IEEE Geoscience and Remote Sensing Society | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Revisiting Hard Example for Action Recognition

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on circuits and systems for video technology : a publication of the Circuits and Systems Society