Comparison of two topological approaches for dealing with noisy labeling

Fabien Rico,Fabrice Muhlenbach,Djamel A Zighed,Stéphane Lallich

doi:10.1016/j.neucom.2014.10.087

Comparison of two topological approaches for dealing with noisy labeling

Fabien Rico, Fabrice Muhlenbach + Show 2 more

Open Access

https://doi.org/10.1016/j.neucom.2014.10.087

Copy DOI

Journal: Neurocomputing	Publication Date: Feb 10, 2015
Citations: 1	License type: other-oa

Affiliation: Claude Bernard University Lyon 1, Laboratoire Hubert Curien, French National Centre for Scientific Research, Lumière University Lyon 2

#Neighborhood Graph #Mislabeled Instances + Show 8 more

Abstract
Full-Text PDF
Similar Papers

Abstract

This paper focuses on the detection of likely mislabeled instances in a learning dataset. In order to detect potentially mislabeled samples, two solutions are considered which are both based on the same framework of topological graphs. The first is a statistical approach based on Cut Edges Weighted statistics (CEW) in the neighborhood graph. The second solution is a Relaxation Technique (RT) that optimizes a local criterion in the neighborhood graph. The evaluations by ROC curves show good results since almost 90% of the mislabeled instances are retrieved for a cost of less than 20% of false positive. The removal of samples detected as mislabeled by our approaches generally leads to an improvement of the performances of classical machine learning algorithms.

Full Text