A Probabilistic Bag-to-Class Approach to Multiple-Instance Learning

Kajsa Møllersen,Fred Godtliebsen,Jon Yngve Hardeberg

doi:10.3390/data5020056

Kajsa Møllersen, Fred Godtliebsen + Show 1 more

Open Access

https://doi.org/10.3390/data5020056

Copy DOI

Abstract

Multi-instance (MI) learning is a branch of machine learning, where each object (bag) consists of multiple feature vectors (instances)—for example, an image consisting of multiple patches and their corresponding feature vectors. In MI classification, each bag in the training set has a class label, but the instances are unlabeled. The instances are most commonly regarded as a set of points in a multi-dimensional space. Alternatively, instances are viewed as realizations of random vectors with corresponding probability distribution, where the bag is the distribution, not the realizations. By introducing the probability distribution space to bag-level classification problems, dissimilarities between probability distributions (divergences) can be applied. The bag-to-bag Kullback–Leibler information is asymptotically the best classifier, but the typical sparseness of MI training sets is an obstacle. We introduce bag-to-class divergence to MI learning, emphasizing the hierarchical nature of the random vectors that makes bags from the same class different. We propose two properties for bag-to-class divergences, and an additional property for sparse training sets, and propose a dissimilarity measure that fulfils them. Its performance is demonstrated on synthetic and real data. The probability distribution space is valid for MI learning, both for the theoretical analysis and applications.

Highlights

We argue that the equality, orthogonality and monotonicity properties possessed by f -divergences are reasonable for bag-to-class divergences, less likely to occur in practice: The equality property and the monotonicity property are valid for uncertain objects, but in practice it can occur with sparse class sampling, and we argue that these properties are valid for bag-to-class divergences
The bag-to-bag KL information has the minimum misclassification rate, the typical bag sparseness of MI training sets is an obstacle. This is partly solved by bag-to-class dissimilarities and the proposed class-conditional KL information accounts for additional sparsity of bags
(1) Aggregation of instances according to bag label and the additional class-conditioning provide a solution for the bag sparsity problem

Summary

Introduction

Machine-learning applications include a wide variety of data types, images being one of the most successful areas. It has had an enormous impact on image analysis, especially in replacing small sets of hand-crafted features with large sets of computer readable features, which often lack apparent. The training data consists of K objects, x, with corresponding class labels, y; {(x1 , y1 ), . The task is to build a classifier that correctly labels a new object. The training data is used to adjust the model according to the desired outcome, often maximizing the accuracy of the classifier

Methods

Results

Discussion

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Probabilistic Bag-to-Class Approach to Multiple-Instance Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data

Lead the way for us

Journal: Data	Publication Date: Jun 26, 2020
License type: CC BY 4.0

Similar Papers

Semisupervised, Multilabel, Multi-Instance Learning for Structured Data.
Hossein Soleimani ... David J Miller
Neural computation | VOL. 29
Hossein Soleimani, et. al.Hossein Soleimani ... David J Miller
17 Jan 2017
Neural computation | VOL. 29

Multi-Instance Mixture Models and Semi-Supervised Learning
James Foulds ... Padhraic Smyth
-
James Foulds, et. al.James Foulds ... Padhraic Smyth
28 Apr 2011
28 Apr 2011

Improving Web Image Search by Bag-Based Reranking
Lixin Duan ... Wen Li
IEEE Transactions on Image Processing | VOL. 20
Lixin Duan, et. al. Lixin Duan ... Wen Li
09 Jun 2011
IEEE Transactions on Image Processing | VOL. 20

Multi-Instance Metric Transfer Learning for Genome-Wide Protein Function Prediction
Yonghui Xu ... Bicui Ye
Scientific Reports | VOL. 7
Yonghui Xu, et. al.Yonghui Xu ... Bicui Ye
06 Feb 2017
Scientific Reports | VOL. 7

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Probabilistic Bag-to-Class Approach to Multiple-Instance Learning

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Data