Why Is the Video Analytics Accuracy Fluctuating, and What Can We Do About It?

Sibendu Paul,Murugan Sankaradas,Oliver Po,Kunal Rao,Y Charlie Hu,Giuseppe Coviello,Srimat Chakradhar

doi:10.1007/978-3-031-25056-9_28

Abstract

AbstractIt is a common practice to think of a video as a sequence of images (frames), and re-use deep neural network models that are trained only on images for similar analytics tasks on videos. In this paper, we show that this “leap of faith” that deep learning models that work well on images will also work well on videos is actually flawed. We show that even when a video camera is viewing a scene that is not changing in any human-perceptible way, and we control for external factors like video compression and environment (lighting), the accuracy of video analytics application fluctuates noticeably. These fluctuations occur because successive frames produced by the video camera may look similar visually, but are perceived quite differently by the video analytics applications. We observed that the root cause for these fluctuations is the dynamic camera parameter changes that a video camera automatically makes in order to capture and produce a visually pleasing video. The camera inadvertently acts as an “unintentional adversary” because these slight changes in the image pixel values in consecutive frames, as we show, have a noticeably adverse impact on the accuracy of insights from video analytics tasks that re-use image-trained deep learning models. To address this inadvertent adversarial effect from the camera, we explore the use of transfer learning techniques to improve learning in video analytics tasks through the transfer of knowledge from learning on image analytics tasks. Our experiments with a number of different cameras, and a variety of different video analytics tasks, show that the inadvertent adversarial effect from the camera can be noticeably offset by quickly re-training the deep learning models using transfer learning. In particular, we show that our newly trained Yolov5 model reduces fluctuation in object detection across frames, which leads to better tracking of objects (\(\sim \)40% fewer mistakes in tracking). Our paper also provides new directions and techniques to mitigate the camera’s adversarial effect on deep learning models used for video analytics applications.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Why Is the Video Analytics Accuracy Fluctuating, and What Can We Do About It?

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Edge-based Video Analytic for Smart Cities
Dipak Pudasaini ... Abdolreza Abhari
International Journal of Advanced Computer Science and Applications | VOL. 12
Dipak Pudasaini, et. al.Dipak Pudasaini ... Abdolreza Abhari
01 Jan 2020
International Journal of Advanced Computer Science and Applications | VOL. 12

Applications of Intelligent Video Analytics in the Field of Retail Management
Harkirat Singh
-
Harkirat SinghHarkirat Singh
01 Jan 2018
01 Jan 2018

APT: Adaptive Perceptual quality based camera Tuning using reinforcement learning
Sibendu Paul ... Murugan Sankaradas
-
Sibendu Paul, et. al.Sibendu Paul ... Murugan Sankaradas
29 Nov 2022
29 Nov 2022

Deep Learning Improves Speed and Accuracy of Prostate Gland Segmentations on Magnetic Resonance Imaging for Targeted Biopsy.
Simon John Christoph Soerensen ... Geoffrey A Sonn
Journal of Urology | VOL. 206
Simon John Christoph Soerensen, et. al.Simon John Christoph Soerensen ... Geoffrey A Sonn
21 Apr 2021
Journal of Urology | VOL. 206

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Why Is the Video Analytics Accuracy Fluctuating, and What Can We Do About It?

Abstract

Talk to us

Similar Papers