Abstract
For remote sensing (RS) scene classification, most of the existing techniques annotate a scene image with merely a single semantic label. However, with the recent advance of remote sensing technology, more abundant information is contained in high-resolution scenes, making a scene image having multiple semantic meanings (i.e., multilabels). Since multi-label RS scene image annotation is a domain full of challenges due to the ambiguities between complicated scene contents and labels, it motivates us to present a novel algorithm which is based on multi-bag integration. First, to describe the semantic content of RS scene image, we propose to partition a scene image into image patches, defined by a regular grid, and extract the heterogeneous features within each. Second, two kinds of image instance bag, namely segmented instance bag (SIB) and layered instance bag (LIB), are designed to represent the scene image. Third, a Mahalanobis distance-based K-Medoids approach is applied to cluster SIB and LIB, respectively, to convert the multi-instance into single-instance, and then the obtained two single-instances are concatenated to generate more powerful scene-aware representation. At last, a multi-class classification technique is used to make predictions on the class labels. Experiments are performed on real remote sensing images and the results show that the proposed method is valid and can achieve superior performance to a number of state-of-the-art approaches.
Highlights
With the continuous advances in sensor technology, a great number of high spatial resolution (HSR) remote sensing (RS) images can be available
This paper focuses on the multi-label classification problem for remote sensing scene images
We have proposed a novel framework based on multi-bag integration for the problem
Summary
With the continuous advances in sensor technology, a great number of high spatial resolution (HSR) remote sensing (RS) images can be available. These HSR images contain abundant spatial and structural information, making the perspective of traditional remote sensing image understanding change. Multi-label classification framework that can assign multiple labels to complex scenes becomes crucial for effective and comprehensive HSR remote sensing image annotation [1]–[3]. N }, the goal of MLL is to learn a function f : X → 2Y which maps an instance xi ∈ X produced by an input image to a set of labels Yi = y1i , y2i , .
Published Version (Free)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have