Bird's Eye View Image Research Articles

AbstractLiDAR‐based 3D place recognition is an essential component of simultaneous localization and mapping systems in multi‐scene robotic applications. However, extracting discriminative and generalizable global descriptors of point clouds is still an open issue due to the insufficient use of the information contained in the LiDAR scans in existing approaches. In this paper, we propose a novel spatial‐temporal point cloud encoding network for multiple scenes, dubbed STM‐Net, to fully fuse the multi‐view spatial information and temporal information of LiDAR point clouds. Specifically, we first develop a spatial feature encoding module consisting of the single‐view transformer and multi‐view transformer. The module learns the correlation both within a single view and between two views by utilizing the multi‐layer range images generated by spherical projection and multi‐layer bird's eye view images generated by top‐down projection. Then in the temporal feature encoding module, we exploit the temporal transformer to mine the temporal information in the sequential point clouds, and a NetVLAD layer is applied to aggregate features and generate sub‐descriptors. Furthermore, we use a GeM pooling layer to fuse more information along the time dimension for the final global descriptors. Extensive experiments conducted on unmanned ground/surface vehicles with different LiDAR configurations indicate that our method (1) achieves superior place recognition performance than state‐of‐the‐art algorithms, (2) generalizes well to diverse sceneries, (3) is robust to viewpoint changes, (4) can operate in real‐time, demonstrating the effectiveness and satisfactory capability of the proposed approach and highlighting its promising applications in multi‐scene place recognition tasks.

Read full abstract

Unmanned driving of agricultural machinery has garnered significant attention in recent years, especially with the development of precision farming and sensor technologies. To achieve high performance and low cost, perception tasks are of great importance. In this study, a low-cost and high-safety method was proposed for field road recognition in unmanned agricultural machinery. The approach of this study utilized point clouds, with low-resolution lidar point clouds as inputs, generating high-resolution point clouds and Bird's Eye View (BEV) images that were encoded with several basic statistics. Using a BEV representation, road detection was reduced to a single-scale problem that could be addressed with an improved U-Net++ neural network. Three enhancements were proposed for U-Net++: 1) replacing the convolutional kernel in the original U-Net++ with an Asymmetric Convolution Block (ACBlock); 2) adding a multi-branch Asymmetric Dilated Convolutional Block (MADC) in the highest semantic information layer; 3) adding an Attention Gate (AG) model to the long-skip-connection in the decoding stage. The results of experiments of this study showed that our algorithm achieved a Mean Intersection Over Union of 96.54% on the 16-channel point clouds, which was 7.35 percentage points higher than U-Net++. Furthermore, the average processing time of the model was about 70 ms, meeting the time requirements of unmanned driving in agricultural machinery. The proposed method of this study can be applied to enhance the perception ability of unmanned agricultural machinery thereby increasing the safety of field road driving. Keywords: image segmentation, unmanned agricultural machinery, field roads, point cloud super-resolution, point cloud bird's eye view DOI: 10.25165/j.ijabe.20231602.7941 Citation: Yang L L, Li Y B, Chang M S, Xu Y Y, Hu B B, Wang X X, et al. Recognition of field roads based on improved U-Net++ Network. Int J Agric & Biol Eng, 2023; 16(2): 171-178.

Read full abstract

Bird's Eye View Image Research Articles

Articles published on Bird's Eye View Image

LiDAR‐based place recognition for mobile robots in ground/water surface multiple scenes

Local Climate Zone Classification Using YOLOV8 Modeling in Instance Segmentation Method

Safe Autonomous Driving with Latent Dynamics and State-Wise Constraints.

A method for estimating ship berthing angle based on 2D bird's eye view point cloud

DPCN++: Differentiable Phase Correlation Network for Versatile Pose Registration.

The Software Design Overview by Processing The Recording From Bird's-Eye View Images to Determine The Crop Detection and Functionality of The Various Land Types

Exploiting Multi-Modal Fusion for Urban Autonomous Driving Using Latent Deep Reinforcement Learning

Multi‐feature subspace representation network for person re‐identification via bird's‐eye view image

Recognition of field roads based on improved U-Net++ Network

TRIMMING AND ROAD ORTHO IMAGING FOR NIGHT IMAGES BY ONBOARD HIGH SENSITIVITY CONSUMER GRADE DIGITAL CAMERAS

Vehicle Trajectory Estimation based on Drive Recorder Data

Quantitative Trimming for Ortho Imaging at Night by Onboard High Sensitivity Consumer Grade Digital Cameras

포인트 클라우드 보간을 이용한 카메라-라이다의 센서 퓨전 기반 객체 검출

CrossFusion net: Deep 3D object detection based on RGB images and point clouds in autonomous driving

Vision-based Bed Detection for Hospital Patient Monitoring System.

Support on Berthing Manoeuver by Using Image Processing

ロボット遠隔操作のためのLiDARを用いた全方位3次元測距による俯瞰映像上での障害物提示

Generic Dynamic Environment Perception Using Smart Mobile Devices.

Accurate Mobile Urban Mapping via Digital Map-Based SLAM.

A Low-Cost Solution for Automatic Lane-Level Map Generation Using Conventional In-Car Sensors

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Bird's Eye View Image Research Articles

Articles published on Bird's Eye View Image

LiDAR‐based place recognition for mobile robots in ground/water surface multiple scenes

Local Climate Zone Classification Using YOLOV8 Modeling in Instance Segmentation Method

Safe Autonomous Driving with Latent Dynamics and State-Wise Constraints.

A method for estimating ship berthing angle based on 2D bird's eye view point cloud

DPCN++: Differentiable Phase Correlation Network for Versatile Pose Registration.

The Software Design Overview by Processing The Recording From Bird's-Eye View Images to Determine The Crop Detection and Functionality of The Various Land Types

Exploiting Multi-Modal Fusion for Urban Autonomous Driving Using Latent Deep Reinforcement Learning

Multi‐feature subspace representation network for person re‐identification via bird's‐eye view image

Recognition of field roads based on improved U-Net++ Network

TRIMMING AND ROAD ORTHO IMAGING FOR NIGHT IMAGES BY ONBOARD HIGH SENSITIVITY CONSUMER GRADE DIGITAL CAMERAS

Vehicle Trajectory Estimation based on Drive Recorder Data

Quantitative Trimming for Ortho Imaging at Night by Onboard High Sensitivity Consumer Grade Digital Cameras

포인트 클라우드 보간을 이용한 카메라-라이다의 센서 퓨전 기반 객체 검출

CrossFusion net: Deep 3D object detection based on RGB images and point clouds in autonomous driving

Vision-based Bed Detection for Hospital Patient Monitoring System.

Support on Berthing Manoeuver by Using Image Processing

ロボット遠隔操作のためのLiDARを用いた全方位3次元測距による俯瞰映像上での障害物提示

Generic Dynamic Environment Perception Using Smart Mobile Devices.

Accurate Mobile Urban Mapping via Digital Map-Based SLAM.

A Low-Cost Solution for Automatic Lane-Level Map Generation Using Conventional In-Car Sensors