How and when to stop the co-training process

Edita Grolman,Dvir Cohen,Tatiana Frenklach,Asaf Shabtai,Rami Puzis

doi:10.1016/j.eswa.2021.115841

Abstract

Co-training is a semi-supervised learning approach used when only a small set of the data that is available for training is labeled. By using multiple classifiers, the co-training process utilizes the small set of labeled data in order to label an additional set of samples. During this process, the classifiers gradually augment the training data in an iterative process in which a new co-training model is derived and used for labeling the unlabeled samples in each iteration. A few of the newly labeled samples are added in each iteration to the training dataset to improve the performance of the classifiers. The main challenge in applying co-training is to make sure that the co-trainer assigns accurate labels to the unlabeled samples. Many empirical studies showed that the performance (accuracy) of the co-trainer could not be further improved when a certain number of iterations was reached, and in some cases, the performance even declined if the process (i.e., labeling) continued. Despite this, no general solution has been suggested for identifying the optimal final co-training model or number of iterations before this decline. In this work, we propose a novel method aimed at selecting the near-optimal final co-training model among all models created in the various iterations according to a predefined measurement based solely on the unlabeled data. Experiments on nine open, publicly available and real-life datasets demonstrate that the proposed method outputs a near-optimal final co-training model compared to other co-training models created in the various iterations.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

How and when to stop the co-training process

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications

Lead the way for us

Journal: Expert Systems with Applications	Publication Date: Sep 17, 2021
Citations: 2

Similar Papers

A Semi-Supervised Deep Learning Approach for the Classification of Steel Surface Defects
Mathuranathan Mayuravaani ... Siyamalan Manivannan
-
Mathuranathan Mayuravaani, et. al.Mathuranathan Mayuravaani ... Siyamalan Manivannan
11 Aug 2021
11 Aug 2021

On semi-supervised linear regression in covariate shift problems
...
Journal of Machine Learning Research | VOL. 16
, et. al. ...
01 Jan 2015
Journal of Machine Learning Research | VOL. 16

A semi-supervised deep learning approach for cropped image detection
Israr Hussain ... Jiwu Huang
Expert Systems with Applications | VOL. 243
Israr Hussain, et. al.Israr Hussain ... Jiwu Huang
12 Dec 2023
Expert Systems with Applications | VOL. 243

An Effective Tumor Classification With Deep Forest and Self-Training
Zhanbo Chen ... Lili Shen
IEEE Access | VOL. 9
Zhanbo Chen, et. al.Zhanbo Chen ... Lili Shen
01 Jan 2020
IEEE Access | VOL. 9

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How and when to stop the co-training process

Abstract

Talk to us

Similar Papers

More From: Expert Systems with Applications