Abstract

The deep neural networks has been developed fast and shown great successes in many significant fields, such as smart surveillance, self-driving and face recognition. The detection of the object with multi-scale and multi-aspect-ratio is still the key problem. In this study, the authors propose a bottom-up feature pyramid network, coordinating with multi-scale feature representation and multi-aspect-ratio anchor generation. Firstly, the multi-scale feature representation is formed by a set of fully convolutional layers which is catenated after the backbone network. Secondly, in order to link the multi-scale feature, the deconvolutional layer is involving after each multi-scale feature map. Thirdly, to tackle the problem of adopting object with different aspect ratios, the anchors on each multi-scale feature map are generated by six shapes. The proposed method is evaluated on PASCAL visual object detection dataset and reach the accuracy of 80.5%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.