Small Object Detection Research Articles

In unmanned aerial vehicles (UAVs) detection, challenges such as occlusion, complex backgrounds, motion blur, and inference time often lead to false detections and missed detections. General object detection frameworks encounter difficulties in adequately tackling these challenges, leading to substantial information loss during network downsampling, inadequate feature fusion, and being unable to meet real-time requirements. In this paper, we propose a Real-Time Small Object Detection YOLO (RTSOD-YOLO) model to tackle the various challenges faced in UAVs detection. We further enhance the adaptive nature of the Adown module by incorporating an adaptive spatial attention mechanism. This mechanism processes the downsampled feature maps, enabling the model to better focus on key regions. Secondly, to address the issue of insufficient feature fusion, we employ combined serial and parallel triple feature encoding (TFE). This approach fuses scale-sequence features from both shallow features and twice-encoded features, resulting in a new small-scale object detection layer. While enhancing the global context awareness of the existing detection layers, this also enriches the small-scale object detection layer with detailed information. Since rich redundant features often ensure a comprehensive understanding of the input, which is a key characteristic of deep neural networks, we propose a more efficient redundant feature generation module. This module generates more feature maps with fewer parameters. Additionally, we introduce reparameterization techniques to compensate for potential feature loss while further improving the model’s inference speed. Experimental results demonstrate that our proposed RTSOD-YOLO achieves superior detection performance, with mAP50/mAP50:95 reaching 97.3%/51.7%, which represents improvement of 3%/3.5% over YOLOv8, and 2.6%/0.1% higher than YOLOv10. Additionally, it has the lowest parameter count and FLOPs, making it highly efficient in terms of computational resources.

Read full abstract

Scaphoid fractures, particularly occult and non-displaced fractures, are difficult to detect using traditional X-ray methods because of their subtle appearance and variability in bone density. This study proposes a two-stage CNN approach to detect and classify scaphoid fractures using anterior-posterior (AP) and lateral (LA) X-ray views for more accurate diagnosis. This study emphasizes the use of multi-view X-ray images (AP and LA views) to improve fracture detection and classification. The multi-view fusion module helps integrate information from both views to enhance detection accuracy, particularly for occult fractures that may not be visible in a single view. The proposed method includes two stages, which are stage 1: detect the scaphoid bone using Faster RCNN and a Feature Pyramid Network (FPN) for region proposal and small object detection. The detection accuracy for scaphoid localization is 100%, with Intersection over Union (IoU) scores of 0.8662 for AP views and 0.8478 for LA views. And stage 2: perform fracture classification using a ResNet backbone and FPN combined with a multi-view fusion module to combine features from both AP and LA views. This stage achieves a classification accuracy of 89.94%, recall of 87.33%, and precision of 90.36%. The proposed model performs well in both scaphoid bone detection and fracture classification. The multi-view fusion approach significantly improves recall and accuracy in detecting fractures compared to single-view approaches. In scaphoid detection, both AP and LA views achieved 100% detection accuracy. In fracture detection, using multi-view fusion, the accuracy for AP views reached 87.16%, and for LA views, it reached 83.83%. The multi-view fusion model effectively improves the detection of scaphoid fractures, particularly in cases of occult and non-displaced fractures. The model provides a reliable, automated approach to assist clinicians in detecting and diagnosing scaphoid fractures more efficiently.

Read full abstract

Small Object Detection Research Articles

Articles published on Small Object Detection

ATBHC-YOLO: aggregate transformer and bidirectional hybrid convolution for small object detection

GreenFruitDetector: Lightweight green fruit detector in orchard environment.

Mamba-UAV-SegNet: A Multi-Scale Adaptive Feature Fusion Network for Real-Time Semantic Segmentation of UAV Aerial Imagery

A Reparameterization Feature Redundancy Extract Network for Unmanned Aerial Vehicles Detection

Object Detection and Tracking in Maritime Environments in Case of Person-Overboard Scenarios: An Overview

CDANet: a small object detection model for unmanned aerial vehicle images based on cross-layer guidance and skip dilated fusion

Vision-guided crack identification and size quantification framework for dam underwater concrete structures

Efficient Small Object Detection You Only Look Once: A Small Object Detection Algorithm for Aerial Images.

Dynamic feature and context enhancement network for faster detection of small objects

DeployFusion: A Deployable Monocular 3D Object Detection with Multi-Sensor Information Fusion in BEV for Edge Devices.

A Heatmap-Supplemented R-CNN Trained Using an Inflated IoU for Small Object Detection

MS-YOLO: A Lightweight and High-Precision YOLO Model for Drowning Detection.

The Detection and Classification of Scaphoid Fractures in Radiograph by Using a Convolutional Neural Network.

YOLO-DHGC: Small Object Detection Using Two-Stream Structure with Dense Connections.

BGF-YOLOv10: Small Object Detection Algorithm from Unmanned Aerial Vehicle Perspective Based on Improved YOLOv10.

SOD-YOLO: A lightweight small object detection framework

A Dual Detection Head YOLO Model With Its Application in Wheat Ear Recognition

AI-empowered Search Drone: Advanced AI-based Tiny Object Search Drone for Broad Areas

CSSDet: small object detection via cross-scale feature enhancement on drone-view images

YOLO-ESL: An Enhanced Pedestrian Recognition Network Based on YOLO

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Small Object Detection Research Articles

Articles published on Small Object Detection

ATBHC-YOLO: aggregate transformer and bidirectional hybrid convolution for small object detection

GreenFruitDetector: Lightweight green fruit detector in orchard environment.

Mamba-UAV-SegNet: A Multi-Scale Adaptive Feature Fusion Network for Real-Time Semantic Segmentation of UAV Aerial Imagery

A Reparameterization Feature Redundancy Extract Network for Unmanned Aerial Vehicles Detection

Object Detection and Tracking in Maritime Environments in Case of Person-Overboard Scenarios: An Overview

CDANet: a small object detection model for unmanned aerial vehicle images based on cross-layer guidance and skip dilated fusion

Vision-guided crack identification and size quantification framework for dam underwater concrete structures

Efficient Small Object Detection You Only Look Once: A Small Object Detection Algorithm for Aerial Images.

Dynamic feature and context enhancement network for faster detection of small objects

DeployFusion: A Deployable Monocular 3D Object Detection with Multi-Sensor Information Fusion in BEV for Edge Devices.

A Heatmap-Supplemented R-CNN Trained Using an Inflated IoU for Small Object Detection

MS-YOLO: A Lightweight and High-Precision YOLO Model for Drowning Detection.

The Detection and Classification of Scaphoid Fractures in Radiograph by Using a Convolutional Neural Network.

YOLO-DHGC: Small Object Detection Using Two-Stream Structure with Dense Connections.

BGF-YOLOv10: Small Object Detection Algorithm from Unmanned Aerial Vehicle Perspective Based on Improved YOLOv10.

SOD-YOLO: A lightweight small object detection framework

A Dual Detection Head YOLO Model With Its Application in Wheat Ear Recognition

AI-empowered Search Drone: Advanced AI-based Tiny Object Search Drone for Broad Areas

CSSDet: small object detection via cross-scale feature enhancement on drone-view images

YOLO-ESL: An Enhanced Pedestrian Recognition Network Based on YOLO