Abstract

Object detection algorithms play an important role in detecting people in surveillance videos. In recent years, with the rapid development of deep learning, the performance of object detection has improved by leaps and bounds, and the scheme of object detection by the YOLOV7 algorithm has also been born. Traditional object detection methods often fail to achieve a balance between speed and accuracy. To address these issues, in this research, an improved YOLOv7 algorithm performance is proposed to get the best speed-to-accuracy balance compared to state-of-the-art object detection within recorded videos using an effective compression method. This method calculates the difference between frames of video, and by using the zero difference approach by removing the duplicate frames from the recorded video and choosing only the meaningful frames based on many variables, including frame size, frame details, and the distance of the frames, influence the choice of a meaningful frame, and this will reduce the size of the video by eliminating the frames comparable to those chosen. Additionally, any other datasets or pre-trained weights have not been used; YOLOv7 has been exclusively trained on the MS COCO dataset from scratch. In order to ensure the effectiveness of this approach, numerous detection systems are used in this work. Additionally, positive performance results to reduce the processing time required for object detection have been attained.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call