Abstract

Abstract. Domain adaptation techniques in transfer learning try to reduce the amount of training data required for classification by adapting a classifier trained on samples from a source domain to a new data set (target domain) where the features may have different distributions. In this paper, we propose a new technique for domain adaptation based on logistic regression. Starting with a classifier trained on training data from the source domain, we iteratively include target domain samples for which class labels have been obtained from the current state of the classifier, while at the same time removing source domain samples. In each iteration the classifier is re-trained, so that the decision boundaries are slowly transferred to the distribution of the target features. To make the transfer procedure more robust we introduce weights as a function of distance from the decision boundary and a new way of regularisation. Our methodology is evaluated using a benchmark data set consisting of aerial images and digital surface models. The experimental results show that in the majority of cases our domain adaptation approach can lead to an improvement of the classification accuracy without additional training data, but also indicate remaining problems if the difference in the feature distributions becomes too large.

Highlights

  • Supervised classification of images and derived data for automatic information retrieval is an important topic in photogrammetry and and remote sensing

  • We address one specific setting of transductive transfer learning called domain adaptation (DA) in which the source and the target domains are supposed to differ by the marginal distributions of the features used in the classification process, e.g. (Bruzzone and Marconcini, 2009)

  • In this paper we expand this method so that it becomes more robust with respect to overlapping feature distributions, and we evaluate the new method using a subset of the ISPRS 2D semantic labelling challenge (Wegner et al, 2016)

Read more

Summary

Introduction

Supervised classification of images and derived data for automatic information retrieval is an important topic in photogrammetry and and remote sensing. Applying a trained classifier to another image than the one from which the training data were generated reduces the amount of manual labour, but this strategy is very likely to give suboptimal results This is due to the fact that in the new image the features may follow a different distribution than in the original one, so that the assumption of the training data being representative for the data to be classified is no longer fulfilled. There are different settings for the TL problem; in the context of the classification of remote sensing images we are mostly interested in the case where labelled training data are only available for the source domain, which is related to the transductive transfer learning paradigm

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call