Abstract

This paper investigates neural network architectures that fuse feature-level data of radar and vision sensors in order to improve automotive environment perception for advanced driver assistance systems. Fusion is performed with occupancy grids, which incorporate sensor-specific information mapped from their individual detection lists. The fusion step is evaluated on three types of neural networks: (1) fully convolutional, (2) auto-encoder and (3) auto-encoder with skipped connections. These networks are trained to fuse radar and camera occupancy grids with the ground truth obtained from lidar scans. A detailed analysis of network architectures and parameters is performed. Results are compared to classical Bayesian occupancy fusion on typical evaluation metrics for pixel-wise classification tasks, like intersection over union and pixel accuracy. This paper shows that it is possible to perform grid fusion of feature-level sensor data with the proposed system architecture. Especially the auto-encoder architectures show significant improvements in evaluation metrics compared to classical Bayesian fusion method.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.