Point-wise Features Research Articles

Multiple Object Tracking (MOT) is a significant task in autonomous driving. Nonetheless, relying on one single sensor is not robust enough, because one modality tends to fail in some challenging situations. Texture information from RGB cameras and 3D structure information from Light Detection and Ranging (LiDAR) have respective advantages under different circumstances. Therefore, feature fusion from multiple modalities contributes to the learning of discriminative features. However, it is nontrivial to achieve effective feature fusion due to the completely distinct information modality. Previous fusion methods usually fuse the top-level features after the backbones extract the features from different modalities. The feature fusion happens solely once, which limits the information interaction between different modalities. In this paper, we propose multi-scale interactive query and fusion between pixel-wise and point-wise features to obtain more discriminative features. In addition, an attention mechanism is utilized to conduct soft feature fusion between multiple pixels and points to avoid inaccurate match problems of previous single pixel-point fusion methods. We introduce PointNet <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$+$</tex-math> </inline-formula> <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> <tex-math notation="LaTeX">$+$</tex-math> </inline-formula> to obtain multi-scale deep representations of point clouds and make it adaptive to our proposed interactive feature fusion between multi-scale features of images and point clouds. Through the interaction module, each modality can integrate more complementary information from the other modality. Besides, we explore the effectiveness of pre-training on each single modality and fine-tuning on the fusion-based model. Our method can achieve 90.32% MOTA and 72.44% HOTA on the KITTI benchmark and outperform other approaches without using multi-scale soft feature fusion.

Point-wise Features Research Articles

Related Topics

Articles published on Point-wise Features

SuPrNet: Super Proxy for 4D occupancy forecasting

A Method for Roof Wireframe Reconstruction Based on Self-Supervised Pretraining

PatchDPCC: A Patchwise Deep Compression Framework for Dynamic Point Clouds

AttentionVote: A coarse-to-fine voting network of anchor-free 6D pose estimation on point cloud for robotic bin-picking application

Interactive Multi-Scale Fusion of 2D and 3D Features for Multi-Object Vehicle Tracking

GCMTN: Low-Overlap Point Cloud Registration Network Combining Dense Graph Convolution and Multilevel Interactive Transformer

SEGANet: 3D object detection with shape-enhancement and geometry-aware network

DGCN-ED: dynamic graph convolutional networks with encoder–decoder structure and its application for airborne LiDAR point classification

Multi-Source Features Fusion Single Stage 3D Object Detection With Transformer

STORM: Structure-Based Overlap Matching for Partial Point Cloud Registration.

DTSSD: Dual-Channel Transformer-Based Network for Point-Based 3D Object Detection

Point cloud completion via structured feature maps using a feedback network

Segment as Points for Efficient and Effective Online Multi-Object Tracking and Segmentation.

Unsupervised Class-Agnostic Instance Segmentation of 3D LiDAR Data for Autonomous Vehicles

SSA3D: Semantic Segmentation Assisted One-Stage Three-Dimensional Vehicle Object Detection

Simultaneous Pose Estimation and Velocity Estimation of an Ego Vehicle and Moving Obstacles Using LiDAR Information Only

SMS-Net: Sparse multi-scale voxel feature aggregation network for LiDAR-based 3D object detection

Fast LiDAR R-CNN: Residual Relation-Aware Region Proposal Networks for Multiclass 3-D Object Detection

PIFNet: 3D Object Detection Using Joint Image and Point Cloud Features for Autonomous Driving

Contrastive Instance Association for 4D Panoptic Segmentation Using Sequences of 3D LiDAR Scans

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Point-wise Features Research Articles

Related Topics

Articles published on Point-wise Features

SuPrNet: Super Proxy for 4D occupancy forecasting

A Method for Roof Wireframe Reconstruction Based on Self-Supervised Pretraining

PatchDPCC: A Patchwise Deep Compression Framework for Dynamic Point Clouds

AttentionVote: A coarse-to-fine voting network of anchor-free 6D pose estimation on point cloud for robotic bin-picking application

Interactive Multi-Scale Fusion of 2D and 3D Features for Multi-Object Vehicle Tracking

GCMTN: Low-Overlap Point Cloud Registration Network Combining Dense Graph Convolution and Multilevel Interactive Transformer

SEGANet: 3D object detection with shape-enhancement and geometry-aware network

DGCN-ED: dynamic graph convolutional networks with encoder–decoder structure and its application for airborne LiDAR point classification

Multi-Source Features Fusion Single Stage 3D Object Detection With Transformer

STORM: Structure-Based Overlap Matching for Partial Point Cloud Registration.

DTSSD: Dual-Channel Transformer-Based Network for Point-Based 3D Object Detection

Point cloud completion via structured feature maps using a feedback network

Segment as Points for Efficient and Effective Online Multi-Object Tracking and Segmentation.

Unsupervised Class-Agnostic Instance Segmentation of 3D LiDAR Data for Autonomous Vehicles

SSA3D: Semantic Segmentation Assisted One-Stage Three-Dimensional Vehicle Object Detection

Simultaneous Pose Estimation and Velocity Estimation of an Ego Vehicle and Moving Obstacles Using LiDAR Information Only

SMS-Net: Sparse multi-scale voxel feature aggregation network for LiDAR-based 3D object detection

Fast LiDAR R-CNN: Residual Relation-Aware Region Proposal Networks for Multiclass 3-D Object Detection

PIFNet: 3D Object Detection Using Joint Image and Point Cloud Features for Autonomous Driving

Contrastive Instance Association for 4D Panoptic Segmentation Using Sequences of 3D LiDAR Scans