Large-scale Object Detection Research Articles

PurposeThis paper aims to implement and extend the You Only Live Once (YOLO) algorithm for detection of objects and activities. The advantage of YOLO is that it only runs a neural network once to detect the objects in an image, which is why it is powerful and fast. Cameras are found at many different crossroads and locations, but video processing of the feed through an object detection algorithm allows determining and tracking what is captured. Video Surveillance has many applications such as Car Tracking and tracking of people related to crime prevention. This paper provides exhaustive comparison between the existing methods and proposed method. Proposed method is found to have highest object detection accuracy.Design/methodology/approachThe goal of this research is to develop a deep learning framework to automate the task of analyzing video footage through object detection in images. This framework processes video feed or image frames from CCTV, webcam or a DroidCam, which allows the camera in a mobile phone to be used as a webcam for a laptop. The object detection algorithm, with its model trained on a large data set of images, is able to load in each image given as an input, process the image and determine the categories of the matching objects that it finds. As a proof of concept, this research demonstrates the algorithm on images of several different objects. This research implements and extends the YOLO algorithm for detection of objects and activities. The advantage of YOLO is that it only runs a neural network once to detect the objects in an image, which is why it is powerful and fast. Cameras are found at many different crossroads and locations, but video processing of the feed through an object detection algorithm allows determining and tracking what is captured. For video surveillance of traffic cameras, this has many applications, such as car tracking and person tracking for crime prevention. In this research, the implemented algorithm with the proposed methodology is compared against several different prior existing methods in literature. The proposed method was found to have the highest object detection accuracy for object detection and activity recognition, better than other existing methods.FindingsThe results indicate that the proposed deep learning–based model can be implemented in real-time for object detection and activity recognition. The added features of car crash detection, fall detection and social distancing detection can be used to implement a real-time video surveillance system that can help save lives and protect people. Such a real-time video surveillance system could be installed at street and traffic cameras and in CCTV systems. When this system would detect a car crash or a fatal human or pedestrian fall with injury, it can be programmed to send automatic messages to the nearest local police, emergency and fire stations. When this system would detect a social distancing violation, it can be programmed to inform the local authorities or sound an alarm with a warning message to alert the public to maintain their distance and avoid spreading their aerosol particles that may cause the spread of viruses, including the COVID-19 virus.Originality/valueThis paper proposes an improved and augmented version of the YOLOv3 model that has been extended to perform activity recognition, such as car crash detection, human fall detection and social distancing detection. The proposed model is based on a deep learning convolutional neural network model used to detect objects in images. The model is trained using the widely used and publicly available Common Objects in Context data set. The proposed model, being an extension of YOLO, can be implemented for real-time object and activity recognition. The proposed model had higher accuracies for both large-scale and all-scale object detection. This proposed model also exceeded all the other previous methods that were compared in extending and augmenting the object detection to activity recognition. The proposed model resulted in the highest accuracy for car crash detection, fall detection and social distancing detection.

Read full abstract

With the rapid development of deep learning, many deep learning-based approaches have made great achievements in object detection tasks. It is generally known that deep learning is a data-driven approach. Data directly impact the performance of object detectors to some extent. Although existing datasets include common objects in remote sensing images, they still have some scale, category, and image limitations. Therefore, there is a strong requirement for establishing a large-scale object detection benchmark for high-resolution remote sensing images. In this paper, we propose a novel benchmark dataset with more than 1 million instances and more than 40,000 images for Fine-grAined object recognItion in high-Resolution remote sensing imagery which is named as FAIR1M. We collected remote sensing images with a resolution of 0.3 m to 0.8 m from different platforms, which are spread across many countries and regions. All objects in the FAIR1M dataset are annotated with respect to 5 categories and 37 subcategories by oriented bounding boxes. Compared with existing detection datasets that are dedicated to object detection, the FAIR1M dataset has 4 particular characteristics: (1) it is much larger than other existing object detection datasets both in terms of the number of instances and the number of images, (2) it provides richer fine-grained category information for objects in remote sensing images, (3) it contains geographic information such as latitude, longitude and resolution attributes, and (4) it provides better image quality due to the use of a careful data cleaning procedure. Based on the FAIR1M dataset, we propose three fine-grained object detection and recognition tasks. Moreover, we evaluate several state-of-the-art approaches to establish baselines for future research. Experimental results indicate that the FAIR1M dataset effectively represents real remote sensing applications and is quite challenging for existing methods. Considering the fine-grained characteristics, we improve the evaluation metric and introduce the idea of hierarchy detection into the algorithms. We believe that the FAIR1M dataset will contribute to the earth observation community via fine-grained object detection in large-scale real-world scenes. FAIR1M Website: http://gaofen-challenge.com/.

Read full abstract

Large-scale Object Detection Research Articles

Related Topics

Articles published on Large-scale Object Detection

Large-Scale Object Detection in the Wild With Imbalanced Data Distribution, and Multi-Labels.

An Improved YOLOv8 Network for Multi-Object Detection with Large-Scale Differences in Remote Sensing Images

MCF-YOLOv5: A Small Target Detection Algorithm Based on Multi-Scale Feature Fusion Improved YOLOv5

WaterPairs: a paired dataset for underwater image enhancement and underwater object detection

SewerOD: A visual sewer disease detection dataset for machine learning

Content-based product image retrieval using squared-hinge loss trained convolutional neural networks

Deformable Part Region Learning and Feature Aggregation Tree Representation for Object Detection.

End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation

Object detection and activity recognition in video surveillance using neural networks

Towards Large-Scale Small Object Detection: Survey and Benchmarks.

Visual Object Detection for Privacy-Preserving Federated Learning

Guest Editorial Introduction to the Special Issue on Advanced Machine Learning Methodologies for Large-Scale Video Object Segmentation and Detection

Detection and Tracking Meet Drones Challenge.

FAIR1M: A benchmark dataset for fine-grained object recognition in high-resolution remote sensing imagery

Checkerboard Dropout: A Structured Dropout With Checkerboard Pattern for Convolutional Neural Networks

Guiding Clean Features for Object Detection in Remote Sensing Images

A Lightweight Object Detection Framework for Remote Sensing Images

Feature Rescaling and Fusion for Tiny Object Detection

Robust Algorithm for Large-Scale Gaussian Patterns Localization

Unsupervised Network Quantization via Fixed-Point Factorization.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Large-scale Object Detection Research Articles

Related Topics

Articles published on Large-scale Object Detection

Large-Scale Object Detection in the Wild With Imbalanced Data Distribution, and Multi-Labels.

An Improved YOLOv8 Network for Multi-Object Detection with Large-Scale Differences in Remote Sensing Images

MCF-YOLOv5: A Small Target Detection Algorithm Based on Multi-Scale Feature Fusion Improved YOLOv5

WaterPairs: a paired dataset for underwater image enhancement and underwater object detection

SewerOD: A visual sewer disease detection dataset for machine learning

Content-based product image retrieval using squared-hinge loss trained convolutional neural networks

Deformable Part Region Learning and Feature Aggregation Tree Representation for Object Detection.

End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation

Object detection and activity recognition in video surveillance using neural networks

Towards Large-Scale Small Object Detection: Survey and Benchmarks.

Visual Object Detection for Privacy-Preserving Federated Learning

Guest Editorial Introduction to the Special Issue on Advanced Machine Learning Methodologies for Large-Scale Video Object Segmentation and Detection

Detection and Tracking Meet Drones Challenge.

FAIR1M: A benchmark dataset for fine-grained object recognition in high-resolution remote sensing imagery

Checkerboard Dropout: A Structured Dropout With Checkerboard Pattern for Convolutional Neural Networks

Guiding Clean Features for Object Detection in Remote Sensing Images

A Lightweight Object Detection Framework for Remote Sensing Images

Feature Rescaling and Fusion for Tiny Object Detection

Robust Algorithm for Large-Scale Gaussian Patterns Localization

Unsupervised Network Quantization via Fixed-Point Factorization.