Context‐guided ground truth sampling for multi‐modality data augmentation in autonomous driving

Peicheng Shi,Aixi Yang,Zhiqiang Liu,Heng Qi

doi:10.1049/itr2.12272

Peicheng Shi, Aixi Yang + Show 2 more

Open Access

https://doi.org/10.1049/itr2.12272

Copy DOI

Abstract

AbstractData augmentation is an important pre‐processing step for object detection in 2D image and 3D point clouds. However, studies on multimodal data augmentation are extremely limited compared to single‐modal work. Moreover, simultaneously ensuring consistency and rationality when pasting both image and point cloud samples is a major challenge in multimodal methods. In this study, a novel multimodal data augmentation method based on ground truth sampling (GT sampling) is proposed for generating content‐rich synthetic scenes. A GT database and scene ground database based on the raw training set is initially built, following which the context of the image and point cloud is used to guide the paste location and filtering strategy of the GT samples. The proposed method can avoid the cluttered features caused by random pasting of samples; the image context information can help the model to learn the correlation between the object and the environment more comprehensively, and the point cloud context information can reduce occlusion in the case of long‐distance objects. The effectiveness of the proposed strategy is demonstrated on the publicly available KITTI dataset. Utilizing the multimodal 3D detector MVXNet as an implementation tool, our experiments evaluate different superimposition strategies ranging from context‐free sample pasting methods to context‐guided new training scenes. In comparison with existing GT sampling methods, our method exhibits a relative performance improvement of 15% on benchmark datasets. In ablation studies, our sample pasting strategy achieves a +2.81% gain compared with previous work. In conclusion, considering the multimodal context of modelled objects is crucial for placing them in the correct environment.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: IET Intelligent Transport Systems	Publication Date: Sep 8, 2022
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Context‐guided ground truth sampling for multi‐modality data augmentation in autonomous driving

Abstract

Talk to us

Similar Papers

More From: IET Intelligent Transport Systems

Lead the way for us

Similar Papers

Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection
Xin Li ... Botian Shi
-
Xin Li, et. al.Xin Li ... Botian Shi
01 Jan 2021
01 Jan 2021

Bi-directional information interaction for multi-modal 3D object detection in real-world traffic scenes
Yadong Wang ... Kunfeng Wang
Expert Systems With Applications | VOL. 262
Yadong Wang, et. al.Yadong Wang ... Kunfeng Wang
01 Nov 2024
Expert Systems With Applications | VOL. 262

Edge extraction by merging 3D point cloud and 2D image data
Ying Wang ... Daniel Schilberg
-
Ying Wang, et. al. Ying Wang ... Daniel Schilberg
01 Oct 2013
01 Oct 2013

Edge Extraction by Merging the 3D Point Cloud and 2D Image Data
Ying Wang ... Daniel Schilberg
-
Ying Wang, et. al.Ying Wang ... Daniel Schilberg
01 Jan 2014
01 Jan 2014

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Context‐guided ground truth sampling for multi‐modality data augmentation in autonomous driving

Abstract

Talk to us

Similar Papers

More From: IET Intelligent Transport Systems