Video object detection from one single image through opto-electronic neural network

Chengyang Hu,Sigang Yang,Hongwei Chen,Minghua Chen,Honghao Huang

doi:10.1063/5.0040424

Abstract

An opto-electronic neural network is designed for video object detection from a long-exposure blurred image. This network combines an optical encoder, convolutional neural network decoder, and object detection module, which are jointly optimized end-to-end. The joint loss is adopted for updating the network according to the physical constraints of hardware via back-propagation. A high-speed refreshed spatial light modulator is used as the encoder part of the network to generate coded sub-images, and then, a single blurred image is obtained after a common camera. The rest of the network is used for video object detection. Both simulations and experiments demonstrate that our framework can successfully retrieve object labels and bounding boxes at different moments in the long exposure. To the best of our knowledge, this is the first work investigating video object detection from a single motion-degraded image.

Highlights

With the flourishing development of deep learning, a number of computer vision tasks have received much attention that teaches machines to perceive the physical world
The parameter λ in the joint loss scitation.org/journal/app is set to 1 for the Adam optimizer and set to 0 for the stochastic gradient descent (SGD)
Training is performed on a workstation with a 3.3 GHz Intel Core i9-9940X central processing unit (CPU) (32 GB RAM) and two Nvidia GeForce RTX2080Ti GPUs

Summary

Introduction

With the flourishing development of deep learning, a number of computer vision tasks have received much attention that teaches machines to perceive the physical world. Object detection has been widely used in a wide range of applications, including autonomous driving, robot vision, and video surveillance. These applications require high-quality images as input to extract precise target features. Regarding motion blur as noise and performing deblurring is a classic software solution.. All existing methods are limited to the task of generating only “one” deblurred image, which loses the information about the motion of the objects in the blurry image. The motion blur combines information about the texture and motion of the object, which can be used for video object detection (VID) in the motion process rather than just as noise

Objectives

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: APL Photonics	Publication Date: Apr 1, 2021
Citations: 16	License type: cc-by

R Discovery Prime

R Discovery Prime

Video object detection from one single image through opto-electronic neural network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: APL Photonics

Lead the way for us

Similar Papers

Visual Feature Learning on Video Object and Human Action Detection: A Systematic Review.
Dengshan Li ... Rujing Wang
Micromachines | VOL. 13
Dengshan Li, et. al.Dengshan Li ... Rujing Wang
31 Dec 2021
Micromachines | VOL. 13

Context Matters: Refining Object Detection in Video with Recurrent Neural Networks
Subarna Tripathi ... Serge Belongie
-
Subarna Tripathi, et. al.Subarna Tripathi ... Serge Belongie
01 Jan 2015
01 Jan 2015

Object detection methods on compressed domain videos: An overview, comparative analysis, and new directions
Donghai Zhai ... Changyou Ma
Measurement | VOL. 207
Donghai Zhai, et. al.Donghai Zhai ... Changyou Ma
21 Dec 2022
Measurement | VOL. 207

Object Detection in Videos with Tubelet Proposal Networks
Kai Kang ... Hongsheng Li
-
Kai Kang, et. al.Kai Kang ... Hongsheng Li
01 Jul 2017
01 Jul 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Video object detection from one single image through opto-electronic neural network

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: APL Photonics