Semi-supervised oblique predictive clustering trees.

Tomaž Stepišnik,Dragi Kocev

doi:10.7717/peerj-cs.506

Tomaž Stepišnik, Dragi Kocev

Open Access

https://doi.org/10.7717/peerj-cs.506

Copy DOI

Abstract

Semi-supervised learning combines supervised and unsupervised learning approaches to learn predictive models from both labeled and unlabeled data. It is most appropriate for problems where labeled examples are difficult to obtain but unlabeled examples are readily available (e.g., drug repurposing). Semi-supervised predictive clustering trees (SSL-PCTs) are a prominent method for semi-supervised learning that achieves good performance on various predictive modeling tasks, including structured output prediction tasks. The main issue, however, is that the learning time scales quadratically with the number of features. In contrast to axis-parallel trees, which only use individual features to split the data, oblique predictive clustering trees (SPYCTs) use linear combinations of features. This makes the splits more flexible and expressive and often leads to better predictive performance. With a carefully designed criterion function, we can use efficient optimization techniques to learn oblique splits. In this paper, we propose semi-supervised oblique predictive clustering trees (SSL-SPYCTs). We adjust the split learning to take unlabeled examples into account while remaining efficient. The main advantage over SSL-PCTs is that the proposed method scales linearly with the number of features. The experimental evaluation confirms the theoretical computational advantage and shows that SSL-SPYCTs often outperform SSL-PCTs and supervised PCTs both in single-tree setting and ensemble settings. We also show that SSL-SPYCTs are better at producing meaningful feature importance scores than supervised SPYCTs when the amount of labeled data is limited.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: PeerJ. Computer science	Publication Date: May 3, 2021
Citations: 3	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Semi-supervised oblique predictive clustering trees.

Abstract

Talk to us

Similar Papers

More From: PeerJ. Computer science

Lead the way for us

Similar Papers

Semi-Supervised Predictive Clustering Trees for (Hierarchical) Multi-Label Classification
Jurica Levatić ... Michelangelo Ceci
International Journal of Intelligent Systems | VOL. 2024
Jurica Levatić, et. al.Jurica Levatić ... Michelangelo Ceci
13 Apr 2024
International Journal of Intelligent Systems | VOL. 2024

Semi-supervised trees for multi-target regression
Jurica Levatić ... Sašo Džeroski
Information Sciences | VOL. 450
Jurica Levatić, et. al.Jurica Levatić ... Sašo Džeroski
12 Mar 2018
Information Sciences | VOL. 450

Exploiting partially-labeled data in learning predictive clustering trees for multi-target regression: A case study of water quality assessment in Ireland
Stevanche Nikoloski ... Sašo Džeroski
Ecological Informatics | VOL. 61
Stevanche Nikoloski, et. al.Stevanche Nikoloski ... Sašo Džeroski
01 Oct 2020
Ecological Informatics | VOL. 61

Adaptive Consistency Regularization for Semi-Supervised Transfer Learning
Abulikemu Abuduweili ... Humphrey Shi
-
Abulikemu Abuduweili, et. al.Abulikemu Abuduweili ... Humphrey Shi
01 Jun 2021
01 Jun 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Semi-supervised oblique predictive clustering trees.

Abstract

Talk to us

Similar Papers

More From: PeerJ. Computer science