Action Recognition From a Single Coded Image.

Sudhakar Kumawat,Hajime Nagahara,Yasushi Yagi,Michitaka Yoshida,Tadashi Okawara

doi:10.1109/tpami.2022.3196350

Abstract

The unprecedented success of deep convolutional neural networks (CNN) on the task of video-based human action recognition assumes the availability of good resolution videos and resources to develop and deploy complex models. Unfortunately, certain budgetary and environmental constraints on the camera system and the recognition model may not be able to accommodate these assumptions and require reducing their complexity. To alleviate these issues, we introduce a deep sensing solution to directly recognize human actions from coded exposure images. Our deep sensing solution consists of a binary CNN-based encoder network that emulates the capturing of a coded exposure image of a dynamic scene using a coded exposure camera, followed by a 2D CNN for recognizing human action in the captured coded exposure image. Furthermore, we propose a novel knowledge distillation framework to jointly train the encoder and the action recognition model and show that the proposed training approach improves the action recognition accuracy by an absolute margin of 6.2%, 2.9%, and 7.9% on Something 2-v2, Kinetics-400, and UCF-101 datasets, respectively, in comparison to our previous approach. Finally, we built a prototype coded exposure camera using LCoS to validate the feasibility of our deep sensing solution. Our evaluation of the prototype camera show results that are consistent with the simulation results.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Action Recognition From a Single Coded Image.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence

Lead the way for us

Journal: IEEE transactions on pattern analysis and machine intelligence	Publication Date: Jan 1, 2022
Citations: 1

Similar Papers

Various frameworks for integrating image and video streams for spatiotemporal information learning employing 2D–3D residual networks for human action recognition
Shaimaa Yosry ... Rania R Ziedan
Discover Applied Sciences | VOL. 6
Shaimaa Yosry, et. al.Shaimaa Yosry ... Rania R Ziedan
18 Mar 2024
Discover Applied Sciences | VOL. 6

Frame-skip Convolutional Neural Networks for action recognition
Yinan Liu ... Liangzhi Tang
-
Yinan Liu, et. al. Yinan Liu ... Liangzhi Tang
01 Jul 2017
01 Jul 2017

Ontology evolution for personalised and adaptive activity recognition
Muhammad Safyan ... Sohail Sarwar
IET Wireless Sensor Systems | VOL. 9
Muhammad Safyan, et. al.Muhammad Safyan ... Sohail Sarwar
01 Aug 2019
IET Wireless Sensor Systems | VOL. 9

Action Recognition Using Deep 3D CNNs with Sequential Feature Aggregation and Attention
Fazliddin Anvarov ... Dae Ha Kim
Electronics | VOL. 9
Fazliddin Anvarov, et. al.Fazliddin Anvarov ... Dae Ha Kim
12 Jan 2020
Electronics | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Action Recognition From a Single Coded Image.

Abstract

Talk to us

Similar Papers

More From: IEEE transactions on pattern analysis and machine intelligence