Abstract
The accuracy of visual object detection, which es-timates locations and classes of target objects in input images, has been drastically improved by the rapid advancements of deep convolutional neural networks (CNNs). The evaluation of existing methods based on CNNs is usually conducted using major datasets such as MS-COCO and PASCAL-VOC, and these datasets include several sizes of target objects. The accuracy of detecting of larger objects has become remarkable via recent methods; however, it remains difficult for recent CNNs to accurately detect small objects. To address this problem, this study investigates how to improve the accuracy of small object detection using CNNs. For the investigation, two types of datasets that solely comprise small target objects were created: the Bird and SAVMAP datasets that solely include flying objects in the sky and mammals in the savannah, respectively. Experimental results obtained with the datasets indicate that the input size, depth of CNN layers, and surrounding context of target objects were important factors for small object detection. Furthermore, these results demonstrate that EfficientDet-DO achieved accuracies of 0.6585 and 0.6501 for the Bird and SAVMAP datasets, respectively.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.