Noisy multi-label semi-supervised dimensionality reduction

Karl Øyvind Mikalsen,Cristina Soguero-Ruiz,Filippo Maria Bianchi,Robert Jenssen

doi:10.1016/j.patcog.2019.01.033

Abstract

Noisy labeled data represent a rich source of information that often are easily accessible and cheap to obtain, but label noise might also have many negative consequences if not accounted for. How to fully utilize noisy labels has been studied extensively within the framework of standard supervised machine learning over a period of several decades. However, very little research has been conducted on solving the challenge posed by noisy labels in non-standard settings. This includes situations where only a fraction of the samples are labeled (semi-supervised) and each high-dimensional sample is associated with multiple labels. In this work, we present a novel semi-supervised and multi-label dimensionality reduction method that effectively utilizes information from both noisy multi-labels and unlabeled data. With the proposed Noisy multi-label semi-supervised dimensionality reduction (NMLSDR) method, the noisy multi-labels are denoised and unlabeled data are labeled simultaneously via a specially designed label propagation algorithm. NMLSDR then learns a projection matrix for reducing the dimensionality by maximizing the dependence between the enlarged and denoised multi-label space and the features in the projected space. Extensive experiments on synthetic data, benchmark datasets, as well as a real-world case study, demonstrate the effectiveness of the proposed algorithm and show that it outperforms state-of-the-art multi-label feature extraction algorithms.

Highlights

Supervised machine learning crucially relies on the accuracy of the observed labels associated with the training samples [1,2,3,4,5,6,7,8,9,10]
It can be seen that the classes are better separated and more compact in the Noisy multi-label semi-supervised dimensionality reduction (NMLSDR) embedding than the supervised multi-label dimensionality reduction (SSMLDR) embedding
In this paper we have introduced the NMLSDR method, a dimensionality reduction method for partially and noisy labeled multi-label data

Summary

Introduction

Supervised machine learning crucially relies on the accuracy of the observed labels associated with the training samples [1,2,3,4,5,6,7,8,9,10]. Observed labels may be corrupted and, they do not necessarily coincide with the true class of the samples. Such inaccurate labels are referred to as noisy [2, 4, 11]. Noisy labels may result from the use of frameworks such as anchor learning [12, 13] or silver standard learning [14], which have received interest for instance in healthcare analytics [15, 16]. A review of various sources of label noise can be found in [2]

Methods

Results

Discussion

Conclusion

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Pattern Recognition	Publication Date: Jan 29, 2019
Citations: 63	License type: cc-by

R Discovery Prime

R Discovery Prime

Noisy multi-label semi-supervised dimensionality reduction

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Pattern Recognition

Lead the way for us

Similar Papers

Transductive local fisher discriminant analysis for gene expression profile-based cancer classification
Danping Li ... Jiajun Wang
-
Danping Li, et. al.Danping Li ... Jiajun Wang
01 Jan 2017
01 Jan 2017

Unsupervised and Semi-supervised Dimensionality Reduction with Self-Organizing Incremental Neural Network and Graph Similarity Constraints
Zhiyang Xiang ... Dong Wang
-
Zhiyang Xiang, et. al.Zhiyang Xiang ... Dong Wang
01 Jan 2015
01 Jan 2015

Adaptive Local Embedding Learning for Semi-Supervised Dimensionality Reduction
Feiping Nie ... Zheng Wang
IEEE Transactions on Knowledge and Data Engineering | VOL. 34
Feiping Nie, et. al.Feiping Nie ... Zheng Wang
07 Jan 2021
IEEE Transactions on Knowledge and Data Engineering | VOL. 34

Robust Class-Specific Autoencoder for Data Cleaning and Classification in the Presence of Label Noise
Weining Zhang ... Dong Wang
Neural Processing Letters | VOL. 50
Weining Zhang, et. al.Weining Zhang ... Dong Wang
14 Dec 2018
Neural Processing Letters | VOL. 50

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Noisy multi-label semi-supervised dimensionality reduction

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Pattern Recognition