Abstract
Rotation-invariant feature extraction is crucial for object detection in Very High Resolution (VHR) optical remote sensing images. Although convolutional neural networks (CNNs) are good at extracting the translation-invariant features and have been widely applied in computer vision, it is still a challenging problem for CNNs to extract rotation-invariant features in VHR optical remote sensing images. In this paper we present a novel Double-Net with sample pairs from the same class as inputs to improve the performance of object detection and classification in VHR optical remote sensing images. Specifically, the proposed Double-Net contains multiple channels of CNNs in which each channel refers to a specific rotation direction and all CNNs share identical weights. Based on the output features of all channels, multiple instance learning algorithm is employed to extract the final rotation-invariant features. Experimental results on two publicly available benchmark datasets, namely Mnist-rot-12K and NWPU VHR-10, demonstrate that the presented Double-Net outperforms existing approaches on the performance of rotation-invariant feature extraction, which is especially effective under the situation of small training samples.
Highlights
Extracting rotation-invariant features is crucial for object detection in very high resolution (VHR) optical remote sensing images
Histogram of oriented gradient (HOG) feature [6] has widely used in remote sensing image classification and object detection
HOG has been extended to rotation-invariant version and applied to object detection in VHR optical remote sensing images [7]
Summary
Extracting rotation-invariant features is crucial for object detection in very high resolution (VHR) optical remote sensing images. SIFT is able to extract rotation-invariant features and transformation-invariant features Such a local feature descriptor fails to represent the whole object in object detection and classification tasks. Histogram of oriented gradient (HOG) feature [6] has widely used in remote sensing image classification and object detection. HOG has been extended to rotation-invariant version and applied to object detection in VHR optical remote sensing images [7]. These handcrafted or shallow-learning-based features, such as SIFT, HOG, and bag-of-words (BoW) [8], have gained
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.