Three-dimensional Object Detection Research Articles

Three-dimensional object detection is a pivotal research topic in computer vision, aiming to identify and locate objects in three-dimensional space. It has wide applications in various fields such as geoscience, autonomous driving, and drone navigation. The rapid development of deep learning techniques has led to significant advancements in 3D object detection. However, with the increasing complexity of applications, 3D object detection faces a series of challenges such as data imbalance and the effectiveness of network models. Specifically, in an experiment, our investigation revealed a notable discrepancy in the LiDAR reflection intensity within a point cloud scene, with stronger intensities observed in proximity and weaker intensities observed at a distance. Furthermore, we have also noted a substantial disparity in the number of foreground points compared to the number of background points. Especially in 3D object detection, the foreground point is more important than the background point, but it is usually downsampled without discrimination in the subsequent processing. With the objective of tackling these challenges, we work from both data and network perspectives, designing a feature alignment filtering algorithm and a two-stage 3D object detection network. Firstly, in order to achieve feature alignment, we introduce a correction equation to decouple the relationship between distance and intensity and eliminate the attenuation effect of intensity caused by distance. Then, a background point filtering algorithm is designed by using the aligned data to alleviate the problem of data imbalance. At the same time, we take into consideration the fact that the accuracy of semantic segmentation plays a crucial role in 3D object detection. Therefore, we propose a two-stage deep learning network that integrates spatial and spectral information, in which a feature fusion branch is designed and embedded in the semantic segmentation backbone. Through a series of experiments on the KITTI dataset, it is proven that the proposed method achieves the following average precision (AP_R40) values for easy, moderate, and hard difficulties, respectively: car (Iou 0.7)—89.23%, 80.14%, and 77.89%; pedestrian (Iou 0.5)—52.32%, 45.47%, and 38.78%; and cyclist (Iou 0.5)—76.41%, 61.92%, and 56.39%. By emphasizing both data quality optimization and efficient network architecture, the performance of the proposed method is made comparable to other state-of-the-art methods.

Read full abstract

Autonomous vehicles (AVs) play a crucial role in enhancing urban mobility within the context of a smarter and more connected urban environment. Three-dimensional object detection in AVs is an essential task for comprehending the driving environment to contribute to their safe use in urban environments. Existing 3D LiDAR object detection systems lose many critical point features during the down-sampling process and neglect the crucial interactions between local features, providing insufficient semantic information and leading to subpar detection performance. We propose a dynamic feature abstraction with self-attention (DFA-SAT), which utilizes self-attention to learn semantic features with contextual information by incorporating neighboring data and focusing on vital geometric details. DFA-SAT comprises four modules: object-based down-sampling (OBDS), semantic and contextual feature extraction (SCFE), multi-level feature re-weighting (MLFR), and local and global features aggregation (LGFA). The OBDS module preserves the maximum number of semantic foreground points along with their spatial information. SCFE learns rich semantic and contextual information with respect to spatial dependencies, refining the point features. MLFR decodes all the point features using a channel-wise multi-layered transformer approach. LGFA combines local features with decoding weights for global features using matrix product keys and query embeddings to learn spatial information across each channel. Extensive experiments using the KITTI dataset demonstrate significant improvements over the mainstream methods SECOND and PointPillars, improving the mean average precision (AP) by 6.86% and 6.43%, respectively, on the KITTI test dataset. DFA-SAT yields better and more stable performance for medium and long distances with a limited impact on real-time performance and model parameters, ensuring a transformative shift akin to when automobiles replaced conventional transportation in cities.

Read full abstract

Three-dimensional Object Detection Research Articles

Related Topics

Articles published on Three-dimensional Object Detection

EFMF-pillars: 3D object detection based on enhanced features and multi-scale fusion

3D Object Detection via Residual SqueezeDet

Spatial awareness enhancement based single-stage anchor-free 3D object detection for autonomous driving

Vehicle Behavior Discovery and Three-Dimensional Object Detection and Tracking Based on Spatio-Temporal Dependency Knowledge and Artificial Fish Swarm Algorithm.

SOA: Seed point offset attention for indoor 3D object detection in point clouds

Rapid identification of moldy peanuts based on three-dimensional hyperspectral object detection

A Multi-Sensor 3D Detection Method for Small Objects

NeXtFusion: Attention-Based Camera-Radar Fusion Network for Improved Three-Dimensional Object Detection and Tracking

Edge-Triggered Three-Dimensional Object Detection Using a LiDAR Ring.

Equal Emphasis on Data and Network: A Two-Stage 3D Point Cloud Object Detection Algorithm with Feature Alignment

Accurate Representation Modeling and Interindividual Constraint Learning for Roadside Three-Dimensional Object Detection

Singular and Multimodal Techniques of 3D Object Detection: Constraints, Advancements and Research Direction

Addressing the Gaps of IoU Loss in 3D Object Detection with IIoU

A LiDAR Multi-Object Detection Algorithm for Autonomous Driving

ConCs-Fusion: A Context Clustering-Based Radar and Camera Fusion for Three-Dimensional Object Detection

Three-dimensional object detection with spatial-semantic features of point clouds

CAF-RCNN: multimodal 3D object detection with cross-attention

PVONet: point-voxel-based semi-supervision monocular three-dimensional object detection using LiDAR camera systems

DFA-SAT: Dynamic Feature Abstraction with Self-Attention-Based 3D Object Detection for Autonomous Driving

CAMRL: A Joint Method of Channel Attention and Multidimensional Regression Loss for 3D Object Detection in Automated Vehicles

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Three-dimensional Object Detection Research Articles

Related Topics

Articles published on Three-dimensional Object Detection

EFMF-pillars: 3D object detection based on enhanced features and multi-scale fusion

3D Object Detection via Residual SqueezeDet

Spatial awareness enhancement based single-stage anchor-free 3D object detection for autonomous driving

Vehicle Behavior Discovery and Three-Dimensional Object Detection and Tracking Based on Spatio-Temporal Dependency Knowledge and Artificial Fish Swarm Algorithm.

SOA: Seed point offset attention for indoor 3D object detection in point clouds

Rapid identification of moldy peanuts based on three-dimensional hyperspectral object detection

A Multi-Sensor 3D Detection Method for Small Objects

NeXtFusion: Attention-Based Camera-Radar Fusion Network for Improved Three-Dimensional Object Detection and Tracking

Edge-Triggered Three-Dimensional Object Detection Using a LiDAR Ring.

Equal Emphasis on Data and Network: A Two-Stage 3D Point Cloud Object Detection Algorithm with Feature Alignment

Accurate Representation Modeling and Interindividual Constraint Learning for Roadside Three-Dimensional Object Detection

Singular and Multimodal Techniques of 3D Object Detection: Constraints, Advancements and Research Direction

Addressing the Gaps of IoU Loss in 3D Object Detection with IIoU

A LiDAR Multi-Object Detection Algorithm for Autonomous Driving

ConCs-Fusion: A Context Clustering-Based Radar and Camera Fusion for Three-Dimensional Object Detection

Three-dimensional object detection with spatial-semantic features of point clouds

CAF-RCNN: multimodal 3D object detection with cross-attention

PVONet: point-voxel-based semi-supervision monocular three-dimensional object detection using LiDAR camera systems

DFA-SAT: Dynamic Feature Abstraction with Self-Attention-Based 3D Object Detection for Autonomous Driving

CAMRL: A Joint Method of Channel Attention and Multidimensional Regression Loss for 3D Object Detection in Automated Vehicles