Abstract

Improving object detection models in computer vision is a crucial area of focus. Object detection represents a more sophisticated approach to image classification, where a neural network identifies objects within an image and delineates them using bounding boxes. YOLO (You Only Look Once) revolutionized object detection by introducing an end-to-end neural network architecture that simultaneously predicts bounding boxes and class probabilities. Unlike earlier methods that adapted classifiers for detection, YOLO's approach streamlines the process by integrating classification and localization tasks, accurately delineating objects with bounding boxes. The project mainly focuses on augmenting YOLOv4, a state-of-the-art object detection architecture, to address the limitations like small object detection, adaptive lighting handling, occlusion challenges, detecting objects at different angles, and specialized object features. Additionally, we aim to improve the model's ability to recognize specialized object features for superior classification. Furthermore, by incorporating functionalities like license plate recognition using Tesseract OCR, object counting (total and class-wise), and detection cropping, this project extends the model's applicability to real-world scenarios. These functionalities make the model valuable in areas like autonomous vehicles, traffic monitoring, and industrial automation. The augmented model's improvements aim to overcome drawbacks identified in existing research, enhancing its efficacy in various scenarios.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call