Abstract

Convolutional neural network (CNN), one of the branches of deep neural networks, has been widely used in image recognition, natural language processing, and other related fields with great success recently. This paper proposes a novel framework with CNN to classify objects in a point cloud captured by LiDAR on urban streets. The proposed BA-CNN algorithm is composed of five steps: (i) removing ground points, (ii) clustering objects, (iii) transforming to bearing angle images, (iv) ROI selection, and (V) identifying objects by CNN. In the first step, ground points are removed by the multi-threshold-based ground detection to reduce the processing time. Then, a flood-fill-based clustering method is used for object segmentation. Those individual point cloud objects are converted to bearing angle (BA) images. Then, a well-trained CNN is used to classify objects with BA images. The main contribution of this paper is proposing an efficient recognition method that uses the information from point clouds only. In contrast, because most 3D object classifiers use the fusion of point clouds and color images, their models are very complicated and take a colossal amount of memory to store the parameters. Since the ground point detection and object clustering process all points along with the scanline-major order and layer-major order, the proposed algorithm performs better in terms of time consumption and memory consumption. In the experiment, three scenes from KITTI dataset are used for training and testing the proposed BA-CNN classifier, and the proposed BA-CNN achieves high classification accuracy.

Highlights

  • WITH 3D surface measurement devices have been widely used, much research about 3D modeling and 3D object recognition/classification was proposed. 3D modeling is to align partial overlapping several point clouds with variant viewpoints into a standard coordinate system

  • Since road scenes can be scanned by light detection and ranging (LiDAR) sensors equipped on a vehicle and represented as point clouds, object recognition/classification is essential for the Advanced Driver Assistance System (ADAS)

  • We performed ablation studies to examine the key factors that can increase the accuracy of bearing angle (BA)-convolutional neural network (CNN)

Read more

Summary

Introduction

WITH 3D surface measurement devices have been widely used, much research about 3D modeling and 3D object recognition/classification was proposed. 3D modeling is to align partial overlapping several point clouds with variant viewpoints into a standard coordinate system. WITH 3D surface measurement devices have been widely used, much research about 3D modeling and 3D object recognition/classification was proposed. 3D modeling is to align partial overlapping several point clouds with variant viewpoints into a standard coordinate system. As for the 3D object recognition/ classification, individual objects would be separated and recognized with their partial surface from a point cloud. Since road scenes can be scanned by LiDAR sensors equipped on a vehicle and represented as point clouds, object recognition/classification is essential for the Advanced Driver Assistance System (ADAS). ADAS does not replace the role of a driver in car control but rather assists him in obtaining information on vehicle operation and its surrounding environment. In ADAS, object recognition usually relies on computer vision methods involving variant sensors, including cameras, sonars, radars, LiDAR (Light Detection and Ranging), and so on

Objectives
Methods
Findings
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call