Abstract
This paper introduce an efficient object detection network named Trident-You Only Look Once (YOLO), which is designed for mobile devices with limited computing power. The new architecture is improved based on YOLO v4-tiny. The authors redesign the network structure and propose a trident feature pyramid network (Trident-FPN), which can improve the precision and recall of lightweight object detection. Specifically, Trident-FPN increases the computational complexity by only a small amount of floating point operations per second (FLOPs) and obtains a multi-scale feature map of the model, which significantly lightweight object detection performance. To enlarge the receptive field of the network with the fewest FLOPs, this paper redesign the receptive field block (RFB) and spatial pyramid pooling (SPP) layer and propose tinier cross-stage partial RFBs and smaller cross-stage partial SPPs. This paper present extensive experiments, and Trident-YOLO shows strong performance compared to that of other popular models on the PASCAL VOC and MS COCO. On the MS COCO and PASCAL VOC 2007 test sets, the mean average precision (mAP) of Trident-YOLO improved by 4.5% and 5.0%, respectively. Trident-YOLO also reduce the network size by more than 54.4% compared to YOLO v4-tiny. With a 23.7% FLOP reduction, the FPS is improved by 1.9 on an Nvidia Jetson Xavier NX.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.