An Efficient Approach to Select Instances in Self-Training and Co-Training Semi-Supervised Methods

Karliane Medeiros Ovidio Vale,Arthur Costa Gorgonio,Flavius Da Luz E Gorgonio,Anne Magaly De Paula Canuto

doi:10.1109/access.2021.3138682

Karliane Medeiros Ovidio Vale, Arthur Costa Gorgonio + Show 2 more

Open Access

https://doi.org/10.1109/access.2021.3138682

Copy DOI

Journal: IEEE Access	Publication Date: Jan 1, 2022
Citations: 12	License type: CC BY 4.0

Affiliation: Universidade Federal do Rio Grande do Norte

Abstract

Semi-supervised learning is a machine learning approach that integrates supervised and unsupervised learning mechanisms. In this learning, most of labels in the training set are unknown, while there is a small part of data that has known labels. The semi-supervised learning is attractive due to its potential to use labeled and unlabeled data to perform better than supervised learning. This paper consists of a study in the field of semi-supervised learning and implements changes on two well-known semi-supervised learning algorithms: self-training and co-training. In the literature, it is common to develop researches that change the structure of these algorithms, however, none of them proposes automating the labeling process of unlabeled instances, which is the main purpose of this work. In order to achieve this goal, three methods are proposed: FlexCon-G, FlexCon and FlexCon-C. The main difference among these methods is the way in which the confidence rate is calculated and the strategy used to select a label in each iteration. In order to evaluate the proposed methods’ performance, an empirical analysis is conducted, in which the performance of these methods has been evaluated on 30 datasets with different characteristics. The obtained results indicate that all three proposed methods perform better than the original self-training and co-training methods, in most analysed cases.

Highlights

T HE technological progress in recent years has greatly promoted the availability of large amounts of data
Depending on the degree of supervision used during the training phase, Machine Learning (ML) techniques can be divided into three categories: supervised, unsupervised and semi-supervised
The ML algorithms learn from past experience and from the implicit knowledge present in existing datasets; what distinguishes them is the fact that the data which such algorithms use have information that may or may not be labeled

Summary

Introduction

T HE technological progress in recent years has greatly promoted the availability of large amounts of data. Depending on the degree of supervision used during the training phase, Machine Learning (ML) techniques can be divided into three categories: supervised, unsupervised and semi-supervised In these three types, the ML algorithms learn from past experience and from the implicit knowledge present in existing datasets; what distinguishes them is the fact that the data which such algorithms use have information that may or may not be labeled. Semi-supervised learning makes it possible to train classifiers with a small amount of labeled data and a large amount of unlabeled data [4] This last mentioned approach for learning has become widely used in recent years [5]–[11]

Objectives

Methods

Results

Full Text

Paper version not known

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

An Efficient Approach to Select Instances in Self-Training and Co-Training Semi-Supervised Methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access

Lead the way for us

Similar Papers

Applying Efficient Selection Techniques of Unlabeled Instances for Wrapper-Based Semi-Supervised Methods
Cephas A S Barreto ... Joao C Xavier-Junior
IEEE Access | VOL. 10
Cephas A S Barreto, et. al.Cephas A S Barreto ... Joao C Xavier-Junior
01 Jan 2021
IEEE Access | VOL. 10

A Distance-Weighted Selection of Unlabelled Instances for Self-training and Co-training Semi-supervised Methods
Cephas A S Barreto ... Anne M P Canuto
-
Cephas A S Barreto, et. al.Cephas A S Barreto ... Anne M P Canuto
01 Jan 2020
01 Jan 2020

A Distance-weighted Selection of Unlabelled Instances for Self-training and Co-training Semi-supervised Methods

-

15 Oct 2020
15 Oct 2020

Estimate Unlabeled-Data-Distribution for Semi-supervised PU Learning
Haoji Hu ... Chaofeng Sha
-
Haoji Hu, et. al.Haoji Hu ... Chaofeng Sha
01 Jan 2012
01 Jan 2012

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Efficient Approach to Select Instances in Self-Training and Co-Training Semi-Supervised Methods

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: IEEE Access