Action recognition based on discrete cosine transform by optical pixel-wise encoding

Yu Liang,Sigang Yang,Minghua Chen,Jingwei Li,Honghao Huang,Hongwei Chen,Xiaowen Dong

doi:10.1063/5.0109807

Yu Liang, Sigang Yang + Show 5 more

Open Access

https://doi.org/10.1063/5.0109807

Copy DOI

Abstract

The framework provides a novel pipeline for action recognition. The action recognition task classifies the action label of the scene. High-speed cameras are commonly used to generate high frame-rate videos for capturing sufficient motion information. However, the data volume would be the bottleneck of the system. With the insight that the discrete cosine transform (DCT) of video signals reveals the motion information remarkably, instead of obtaining video data as with traditional cameras, the proposed method directly captures a DCT spectrum of video in a single shot through optical pixel-wise encoding. Considering that video signals are sparsely distributed in the DCT domain, a learning-based frequency selector is designed for pruning the trivial frequency channels of the spectrum. An opto-electronic neural network is designed for action recognition from a single coded spectrum. The optical encoder generates the DCT spectrum, and the rest of the network jointly optimizes the frequency selector and classification model simultaneously. Compared to conventional video-based action recognition methods, the proposed method achieves higher accuracy with less data, less communication bandwidth, and less computational burden. Both simulations and experiments demonstrate that the proposed method has superior action recognition performance. To the best of our knowledge, this is the first work investigating action recognition in the DCT domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: APL Photonics	Publication Date: Nov 1, 2022
Citations: 5	License type: cc-by

R Discovery Prime

R Discovery Prime

Action recognition based on discrete cosine transform by optical pixel-wise encoding

Abstract

Talk to us

Similar Papers

More From: APL Photonics

Lead the way for us

Similar Papers

저작권 보호를 위한 DCT 영역에서의 향상된 CRT 기반 영상 워터마킹
Sung-Ho Bae
Journal of Korea Multimedia Society | VOL. 16
Sung-Ho BaeSung-Ho Bae
30 Oct 2013
Journal of Korea Multimedia Society | VOL. 16

DCT Domain Encryption in LSB Steganography
Rajib Biswas ... Samir Kumar Bandyopadhyay
-
Rajib Biswas, et. al.Rajib Biswas ... Samir Kumar Bandyopadhyay
01 Sep 2013
01 Sep 2013

Multi-focus image fusion using Singular Value Decomposition in DCT domain
Mostafa Amin-Naji ... Ali Aghagolzadeh
-
Mostafa Amin-Naji, et. al.Mostafa Amin-Naji ... Ali Aghagolzadeh
01 Nov 2017
01 Nov 2017

Image up-sampling using deep cascaded neural networks in dual domains for images down-sampled in DCT domain
Kwok-Wai Hung ... Jianmin Jiang
Journal of Visual Communication and Image Representation | VOL. 56
Kwok-Wai Hung, et. al.Kwok-Wai Hung ... Jianmin Jiang
13 Sep 2018
Journal of Visual Communication and Image Representation | VOL. 56

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Action recognition based on discrete cosine transform by optical pixel-wise encoding

Abstract

Talk to us

Similar Papers

More From: APL Photonics