Abstract
The single shared head detector is mostly used for both classification and regression tasks in object detection networks. However, the two tasks focus on different features, and the features extracted from a single head may inhibit each other’s expression. In 2D detection, a decoupled representation with multi-head structure works better than the shared one for both tasks in the R-CNN [1] blocks. Since the RCNN structure is not available for single-stage detectors, the decoupling must be implemented in a slightly different way. Therefore, we propose a double-head structure based on single-stage point cloud object detection, with two decoupled heads to describe the classification and regression tasks in the high-level feature map, respectively. One of the heads is used to extract classification information, while the other one is used for regression information. In particular, the regression information also contributes to the classification task, and we denote the category result by the union of the classification values predicted by both the regression head and the classification head. Our approach can improve the AP of the baseline in the car detection task on the kitti dataset.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.