Learning Algorithms for Domain Adaptation

Manas A Pathak,Eric H Nyberg

doi:10.1007/978-3-642-05224-8_23

Abstract

A fundamental assumption for any machine learning task is to have training and test data instances drawn from the same distribution while having a sufficiently large number of training instances. In many practical settings, this ideal assumption is invalidated as the labeled training instances are scarce and there is a high cost associated with labeling them. On the other hand, we might have access to plenty of labeled data from a different domain, which can provide useful information for the present domain. In this paper, we discuss adaptive learning techniques to address this specific problem: learning with little training data from the same distribution along with a large pool of data from a different distribution. An underlying theme of our work is to identify situations when the auxiliary data is likely to help in training with the primary data. We propose two algorithms for the domain adaptation task: dataset reweighting and subset selection. We present theoretical analysis of behavior of the algorithms based on the concept of domain similarity, which we use to formulate error bounds for our algorithms. We also present an experimental evaluation of our techniques on data from a real world question answering system.KeywordsDomain AdaptationTraining InstanceAuxiliary DataHypothesis SpacePrimary DatasetThese keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Learning Algorithms for Domain Adaptation

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Efficient data acquisition and training of collisional-radiative model artificial neural network surrogates through adaptive parameter space sampling
Nathan A Garland ... Prasanna Balaprakash
Machine Learning: Science and Technology | VOL. 3
Nathan A Garland, et. al.Nathan A Garland ... Prasanna Balaprakash
10 Oct 2022
Machine Learning: Science and Technology | VOL. 3

A Sensors Based Deep Learning Model for Unseen Locomotion Mode Identification using Multiple Semantic Matrices
Rahul Mishra ... Tanima Dutta
IEEE Transactions on Mobile Computing | VOL. 21
Rahul Mishra, et. al.Rahul Mishra ... Tanima Dutta
01 Mar 2022
IEEE Transactions on Mobile Computing | VOL. 21

Improving SVM accuracy by training on auxiliary data sources
Pengcheng Wu ... Thomas G Dietterich
-
Pengcheng Wu, et. al.Pengcheng Wu ... Thomas G Dietterich
01 Jan 2004
01 Jan 2004

A Transfer-Learning Approach to Image Segmentation Across Scanners by Maximizing Distribution Similarity
Annegreet Van Opbroek ... Meike W Vernooij
-
Annegreet Van Opbroek, et. al.Annegreet Van Opbroek ... Meike W Vernooij
01 Jan 2013
01 Jan 2013

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Learning Algorithms for Domain Adaptation

Abstract

Talk to us

Similar Papers