Standardization with zlog values improves exploratory data analysis and machine learning for laboratory data

Amani Al-Mekhlafi,Sandra Klawitter,Frank Klawonn

doi:10.1515/labmed-2024-0051

Abstract

Abstract Objectives In the context of exploratory data analysis and machine learning, standardization of laboratory results is an important pre-processing step. Variable proportions of pathological results in routine datasets lead to changes of the mean (µ) and standard deviation (σ), and thus cause problems in the classical z-score transformation. Therefore, this study investigates whether the zlog transformation compensates these disadvantages and makes the results more meaningful from a medical perspective. Methods The results presented here were obtained with the statistical software environment R, and the underlying data set was obtained from the UC Irvine Machine Learning Repository. We compare the differences of the zlog and z-score transformation for five different dimension reduction methods, hierarchical clustering and four supervised classification methods. Results With the zlog transformation, we obtain better results in this study than with the z-score transformation for dimension reduction, clustering and classification methods. By compensating the disadvantages of the z-score transformation, the zlog transformation allows more meaningful medical conclusions. Conclusions We recommend using the zlog transformation of laboratory results for pre-processing when exploratory data analysis and machine learning techniques are applied.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Journal of Laboratory Medicine	Publication Date: Jun 27, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Standardization with zlog values improves exploratory data analysis and machine learning for laboratory data

Abstract

Talk to us

Similar Papers

More From: Journal of Laboratory Medicine

Lead the way for us

Similar Papers

Review of classical dimensionality reduction and sample selection methods for large-scale data processing
Xinzheng Xu ... Jiong Zhu
Neurocomputing | VOL. 328
Xinzheng Xu, et. al.Xinzheng Xu ... Jiong Zhu
17 Aug 2018
Neurocomputing | VOL. 328

Identifying nuclear protein subcellular localization using feature dimension reduction method
Tong Wang ... Qinghua Huang
-
Tong Wang, et. al. Tong Wang ... Qinghua Huang
01 Sep 2010
01 Sep 2010

Comparative study of different dimensionality reduction methods in hyperspectral image classification
Lei Kang ... Yanan Jiang
Journal of Physics: Conference Series | VOL. 2024
Lei Kang, et. al.Lei Kang ... Yanan Jiang
01 Sep 2021
Journal of Physics: Conference Series | VOL. 2024

Unsupervised Dimensionality Reduction for High-Dimensional Data Classification
...
-
, et. al. ...
31 Aug 2017
31 Aug 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Standardization with zlog values improves exploratory data analysis and machine learning for laboratory data

Abstract

Talk to us

Similar Papers

More From: Journal of Laboratory Medicine