Real-Time Object Detection and Recognition with Computer Vision

Vishesh Tamrakar

doi:10.22214/ijraset.2024.61921

Abstract

Abstract: Object detection, a fundamental aspect of computer vision, is essential for identifying and localizing objects within images or video frames, leveraging advancements in deep learning, particularly convolutional neural networks (CNNs), to enhance precision and speed. Its applications span diverse domains, from autonomous vehicles and surveillance systems to augmented reality and human-computer interaction. Our project focuses on engineering a real-time object detection system, integrating deep learning and computer vision methodologies. Anchored on the robust Single Shot Multibox Detector (SSD) architecture and reinforced by the efficiency and accuracy of the MobileNetV3 backbone, our system utilizes a pre-trained SSD MobileNetV3 model and comprehensive annotations from the COCO dataset to adeptly detect and recognize a wide array of objects within live video streams or archived footage. It seamlessly processes video frames from various sources, annotating detected objects in real-time to provide instant visual feedback. Offering customizable confidence thresholds and support for multiple video sources, our project showcases the transformative potential of deep learning and computer vision, advancing realtime object detection across domains like surveillance and interactive systems. By pushing the boundaries of object detection technology, our project aims to enhance safety, efficiency, and user experiences in various applications, promising to redefine the landscape of computer vision with innovation and advancement.

Full Text