Abstract

AbstractIn this paper, we present new feature encoding methods for detection of 3D objects in point clouds. We used a graph neural network (GNN) for detection of 3D objects, namely cars, pedestrians, and cyclists. Feature encoding is one of the important steps in detection of 3D objects. The dataset used is point cloud data which is irregular and unstructured, and it needs to be encoded in such a way that ensures better feature encapsulation. Earlier works have used relative distance as one of the methods to encode the features. These methods are not resistant to rotation variance problems in graph neural networks. We have included angular-based measures while performing feature encoding in graph neural networks. Along with that, we have performed a comparison between other methods like absolute, relative, Euclidean distances, and a combination of the angle and relative methods. The model is trained and evaluated on the subset of the KITTI object detection benchmark dataset under resource constraints. Our results demonstrate that a combination of angle measures and relative distance has performed better than other methods. In comparison to the baseline method (relative), it achieved better performance. We also performed time analysis of various feature encoding methods.KeywordsFeature encodingObject detectionLiDARKITTIGNN

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.