Unsupervised representation learning with Minimax distance measures

Morteza Haghir Chehreghani

doi:10.1007/s10994-020-05886-4

Abstract

We investigate the use of Minimax distances to extract in a nonparametric way the features that capture the unknown underlying patterns and structures in the data. We develop a general-purpose and computationally efficient framework to employ Minimax distances with many machine learning methods that perform on numerical data. We study both computing the pairwise Minimax distances for all pairs of objects and as well as computing the Minimax distances of all the objects to/from a fixed (test) object. We first efficiently compute the pairwise Minimax distances between the objects, using the equivalence of Minimax distances over a graph and over a minimum spanning tree constructed on that. Then, we perform an embedding of the pairwise Minimax distances into a new vector space, such that their squared Euclidean distances in the new space equal to the pairwise Minimax distances in the original space. We also study the case of having multiple pairwise Minimax matrices, instead of a single one. Thereby, we propose an embedding via first summing up the centered matrices and then performing an eigenvalue decomposition to obtain the relevant features. In the following, we study computing Minimax distances from a fixed (test) object which can be used for instance in K-nearest neighbor search. Similar to the case of all-pair pairwise Minimax distances, we develop an efficient and general-purpose algorithm that is applicable with any arbitrary base distance measure. Moreover, we investigate in detail the edges selected by the Minimax distances and thereby explore the ability of Minimax distances in detecting outlier objects. Finally, for each setting, we perform several experiments to demonstrate the effectiveness of our framework.

Highlights

Data is usually described by a set of objects and a corresponding representation
We developed a framework to apply Minimax distances to any learning algorithm that works on numerical data, which takes into account both generality and efficiency
We studied both computing the pairwise Minimax distances for all pairs of objects and as well as computing the Minimax distances of all the objects to/from a fixed object

Summary

Introduction

Data is usually described by a set of objects and a corresponding representation. The basic representation, e.g., Euclidean distance, Mahalanobis distance, cosine similarity and Pearson correlation, might fail to correctly capture the underlying patterns or classes. Kernel methods are a common approach to enrich the basic representation of the data and model the underlying patterns (ShaweTaylor and Cristianini 2004; Hofmann et al 2008). The applicability of kernels is confined by several limitations, such as, (1) finding the optimal parameter(s) of a kernel function is often very critical and nontrivial (Nadler and Galun 2007; Luxburg 2007), and (2) as we will see later, kernels assume a global structure which does not distinguish between the different type of classes in the data

Objectives

Methods

Findings

Conclusion

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning	Publication Date: Jul 28, 2020
Citations: 10	License type: open-access

R Discovery Prime

R Discovery Prime

Unsupervised representation learning with Minimax distance measures

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning

Lead the way for us

Similar Papers

Classification with Minimax Distance Measures
Morteza Haghir Chehreghani
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 31
Morteza Haghir ChehreghaniMorteza Haghir Chehreghani
13 Feb 2017
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 31

Walking on Minimax Paths for k-NN Search
Kye-Hyeon Kim ... Seungjin Choi
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 27
Kye-Hyeon Kim, et. al.Kye-Hyeon Kim ... Seungjin Choi
30 Jun 2013
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 27

Some Intuitionist Fuzzy Weighted Geometric Distance Measures and Their Application to Group Decision Making
Bo Peng ... Shouzhen Zeng
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems | VOL. 22
Bo Peng, et. al.Bo Peng ... Shouzhen Zeng
01 Oct 2014
International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems | VOL. 22

Finding Top-k Similar Pairs of Objects Annotated with Terms from an Ontology
Arnab Bhattacharya ... Ambuj K Singh
-
Arnab Bhattacharya, et. al.Arnab Bhattacharya ... Ambuj K Singh
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Unsupervised representation learning with Minimax distance measures

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Machine Learning