Class imbalance in gradient boosting classification algorithms: Application to experimental stroke data.

Olga Lyashevska,Jan-Hendrik Buhk,Liam Morris,Jens Fiehler,Fiona Malone,Eugene Maccarthy

doi:10.1177/0962280220980484

Abstract

Imbalance between positive and negative outcomes, a so-called class imbalance, is a problem generally found in medical data. Imbalanced data hinder the performance of conventional classification methods which aim to improve the overall accuracy of the model without accounting for uneven distribution of the classes. To rectify this, the data can be resampled by oversampling the positive (minority) class until the classes are approximately equally represented. After that, a prediction model such as gradient boosting algorithm can be fitted with greater confidence. This classification method allows for non-linear relationships and deep interactive effects while focusing on difficult areas by iterative shifting towards problematic observations. In this study, we demonstrate application of these methods to medical data and develop a practical framework for evaluation of features contributing into the probability of stroke.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Class imbalance in gradient boosting classification algorithms: Application to experimental stroke data.

Abstract

Talk to us

Similar Papers

More From: Statistical Methods in Medical Research

Lead the way for us

Journal: Statistical Methods in Medical Research	Publication Date: Dec 28, 2020
Citations: 10

Similar Papers

Logistic discrimination based on G-mean and F-measure for imbalanced problem
Huaping Guo ... Changan Wu
Journal of Intelligent & Fuzzy Systems | VOL. 31
Huaping Guo, et. al.Huaping Guo ... Changan Wu
13 Aug 2016
Journal of Intelligent & Fuzzy Systems | VOL. 31

Decision letter: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Larisa V Suturina ... Ricardo Azziz
-
Larisa V Suturina, et. al.Larisa V Suturina ... Ricardo Azziz
12 Dec 2022
12 Dec 2022

Author response: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Zhuoran Zhang ... Wenyuan Chen
-
Zhuoran Zhang, et. al.Zhuoran Zhang ... Wenyuan Chen
12 Jan 2023
12 Jan 2023

Editor's evaluation: Development and evaluation of a live birth prediction model for evaluating human blastocysts from a retrospective study
Larisa V Suturina
-
Larisa V SuturinaLarisa V Suturina
12 Dec 2022
12 Dec 2022

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Class imbalance in gradient boosting classification algorithms: Application to experimental stroke data.

Abstract

Talk to us

Similar Papers

More From: Statistical Methods in Medical Research