Abstract

Object detection has drawn the attention of many researchers due to its wide application in computer vision-related applications. In this paper, a novel model is proposed for object detection. Firstly, a new neck is designed for the proposed detection model, including an efficient SPPNet (Spatial Pyramid Pooling Network), a modified NLNet (Non Local Network) and a lightweight adaptive feature fusion module. Secondly, the detection head with double residual branch structure is presented to reduce the delay of a decoupled head and improve the detection ability. Finally, these improvements are embedded in YOLOX as plug-and-play modules for forming a high-performance detector, EYOLOX (EfficientYOLOX). Extensive experiments demonstrate that the EYOLOX achieves significant improvements, which increases YOLOX-s from 40.5% to 42.2% AP on the MS COCO dataset with a single GPU. Moreover, the performance of the detection of EYOLOX also outperforms YOLOv6 and some SOTA methods with the same number of parameters and GFLOPs. In particular, EYOLOX has only been trained on the COCO-2017 dataset without using any other datasets, and only the pre-training weights of the backbone part are loaded.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.