Robust 3D Point Cloud Recognition: Enhancing Robustness with GPT-4 and CLIP Integration

Lei Pan,Junhui Li,Wuyang Luan,Yuan Zhen,Chang Xu

doi:10.1142/s0218126624502955

Abstract

In recent years, deep neural networks have achieved significant success in 3D point cloud recognition tasks. However, these models still demonstrate substantial performance challenges in the presence of data corruption. It is crucial to improve the robustness and generalization ability of the model. In this work, we propose a novel framework that combines GPT and CLIP models to enhance the robustness of existing point cloud classification models, which has two main modules: the Text-Image Fusion Module, which includes a GPT-Driven TextGen Processor and FocalView Projection, and the Dual-Path Intelligent Adapter Module. First, the GPT-Driven TextGen Processor leverages GPT-4’s capabilities to generate detailed textual descriptions tailored to point cloud intricacies. FocalView Projection dynamically selects viewpoints based on attention maps, enhancing two-dimensional representations of three-dimensional point clouds. Secondly, the Dual-Path Intelligent Adapter Module achieves fine-tuning and feature adaptation by combining internal and external adapters. Additionally, during the fine-tuning process, we employ a variant of Projected Gradient Descent (PGD) adversarial training, named VPGD, to increase the model’s resilience to adversarial perturbations. Our approach has achieved state-of-the-art results on robust 3D points cloud recognition datasets such as ModelNet40-C and ScanObjectNN-C.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Robust 3D Point Cloud Recognition: Enhancing Robustness with GPT-4 and CLIP Integration

Abstract

Talk to us

Similar Papers

More From: Journal of Circuits, Systems and Computers

Lead the way for us

Similar Papers

Unsupervised Point Cloud Representation Learning with Deep Neural Networks: A Survey.
Aoran Xiao ... Ling Shao
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. PP
Aoran Xiao, et. al.Aoran Xiao ... Ling Shao
01 Sep 2023
IEEE Transactions on Pattern Analysis and Machine Intelligence | VOL. PP

ROAD SEGMENTATION ON LOW RESOLUTION LIDAR POINT CLOUDS FOR AUTONOMOUS VEHICLES
L Gigli ... N Vemuri
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. V-2-2020
L Gigli, et. al.L Gigli ... N Vemuri
03 Aug 2020
ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences | VOL. V-2-2020

Local voxelized structure for 3D binary feature representation and robust registration of point clouds from low-cost sensors
Siwen Quan ... Tao Ma
Information Sciences | VOL. 444
Siwen Quan, et. al.Siwen Quan ... Tao Ma
06 Mar 2018
Information Sciences | VOL. 444

CardioDefense: Defending against adversarial attack in ECG classification with adversarial distillation training
Jiahao Shao ... Shenda Hong
Biomedical Signal Processing and Control | VOL. 91
Jiahao Shao, et. al.Jiahao Shao ... Shenda Hong
05 Jan 2024
Biomedical Signal Processing and Control | VOL. 91

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Robust 3D Point Cloud Recognition: Enhancing Robustness with GPT-4 and CLIP Integration

Abstract

Talk to us

Similar Papers

More From: Journal of Circuits, Systems and Computers