LGST-Drop: label-guided structural dropout for spatial–temporal convolutional neural networks

Hu Cui,Chuhua Huang,Renjing Huang,Ruoyu Zhang

doi:10.1117/1.jei.31.3.033036

Abstract

Region dropout regularization strategies have proven to be highly effective at improving the generalization performance of convolutional neural networks (CNNs) in a variety of computer vision tasks including image classification, object detection, and semantic segmentation because these strategies enable models to focus on a wider range of image region information. However, for action recognition, models need to be able to extract not only useful spatial information but also important temporal and motion information, which cannot be satisfied by traditional regularization strategies. We propose a spatiotemporal dropout strategy to meet the need for regularization in spatial–temporal CNNs. We call it label guided spatial–temporal drop (LGST-Drop); it not only provides effectively structured dropout in the spatial dimension but also regularizes motion information in the temporal dimension. In addition, LGST-Drop’s mask is guided by the predicted categories of the model itself, which we called temporary labels. Extensive experiments on several standard datasets from action recognition domains show the usefulness of the proposed technique in comparison with the previous methods and theirstate-of-the-art variant algorithms.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

LGST-Drop: label-guided structural dropout for spatial–temporal convolutional neural networks

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging

Lead the way for us

Similar Papers

Skeleton-based action recognition based on multidimensional adaptive dynamic temporal graph convolutional network
Yu Xia ... Yi Cao
Engineering Applications of Artificial Intelligence | VOL. 127
Yu Xia, et. al.Yu Xia ... Yi Cao
03 Oct 2023
Engineering Applications of Artificial Intelligence | VOL. 127

Spatial–temporal convolutional neural networks for anomaly detection and localization in crowded scenes
Shifu Zhou ... Zhijiang Zhang
Signal Processing: Image Communication | VOL. 47
Shifu Zhou, et. al.Shifu Zhou ... Zhijiang Zhang
14 Jul 2016
Signal Processing: Image Communication | VOL. 47

Object detection and activity recognition in video surveillance using neural networks
Vishva Payghode ... Ashwani Kumar Dubey
International Journal of Web Information Systems | VOL. 19
Vishva Payghode, et. al.Vishva Payghode ... Ashwani Kumar Dubey
20 Apr 2023
International Journal of Web Information Systems | VOL. 19

Model distillation for high-level semantic understanding：a survey
Ruoyu Sun ... Hongkai Xiong
Journal of Image and Graphics | VOL. 28
Ruoyu Sun, et. al.Ruoyu Sun ... Hongkai Xiong
01 Jan 2023
Journal of Image and Graphics | VOL. 28

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

LGST-Drop: label-guided structural dropout for spatial–temporal convolutional neural networks

Abstract

Talk to us

Similar Papers

More From: Journal of Electronic Imaging