Abstract

Active learning is an area of significant ongoing research interest for the classification of remotely sensed data, where obtaining efficient training data is both time consuming and expensive. The goal of active learning is to achieve high classification performance by querying as few samples as possible from a large unlabeled data pool. Traditional active learning frameworks all assume the existence of labeled samples for all classes of interest. However, in real-world applications, the unlabeled data pool may contain data from unknown classes that we are not aware of in advance, and a quick detection of them is useful for enriching our training set. In this scenario, traditional active learning methods may not effectively and rapidly detect the unknown classes. We proposed an active learning framework which provides robust classification performance with minimum manual labeling effort while simultaneously discovering unknown (missing) classes. The discovery of unknown classes is particularly suited to an active learning framework where an annotator is in the loop. A Dirichlet process mixture model is utilized in our proposed method to cluster the labeled and unlabeled samples as a whole. If unknown classes exist, they will emerge as new clusters which are different from other existing clusters occupied by known classes, and then, the proposed query strategy will give priority to querying samples in the new clusters. We present experimental results with hyperspectral data to show that our method provides better classification performance compared to existing active learning methods with or without unknown classes.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.