A Comparison of Data Imputation Methods Utilizing Machine Learning for a New IoT System Platform

Sena Kalay,İnci Sarıçiçek,Eyüp Çinar

doi:10.1109/codit55151.2022.9804113

Abstract

IoT systems are being used widely place in manufacturing. The volume of thesensor data in these systems is significant. In real-life scenarios, missing sensor data can cause problems, especially for data-driven machine learning (ML) models. The gaps due to missing sensor data should be handled before employing machine learning models. The common practices are to remove the missing data completely or apply simple arithmetic operations. However, there are more sophisticated approaches in the literature that can be applied to these real-time IoT systems considering the native data characteristics. This study compares the performance of regression-based ML algorithms missing data imputation methods such as Support Vector Regression (SVR), Decision Tree Regression (DTR), Ridge Regression, K-Nearest Neighbors Regression (KNN), MissForest (MF), and XGBoost Regression (XGB). Missing data in different positions and proportions are created utilizing experimentally collected time-series sensor data from a newly developed IoT system platform. The initial work based on the ML models is presented on these datasets together with an overview of the IoT system architecture. The average RMSE and R <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">2</sup> values of the six ML models showed that the Ridge Regression outperforms the other ML models for the missing data imputation.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparison of Data Imputation Methods Utilizing Machine Learning for a New IoT System Platform

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Data Conditioning and Forecasting Methodology using Machine Learning on Production Data for a Well Pad
Maryam Bagheri ... Haoran Zhao
-
Maryam Bagheri, et. al.Maryam Bagheri ... Haoran Zhao
04 May 2020
04 May 2020

Application of Machine Learning Models to Bridge Afflux Estimation
Reza Piraei ... Andrea Menapace
Water | VOL. 15
Reza Piraei, et. al.Reza Piraei ... Andrea Menapace
10 Jun 2023
Water | VOL. 15

Machine learning approaches for formation matrix volume prediction from well logs: Insights and lessons learned
Pamidi Venkata Durga Kannaiah ... Neetish Kumar Maurya
Geoenergy Science and Engineering | VOL. 229
Pamidi Venkata Durga Kannaiah, et. al.Pamidi Venkata Durga Kannaiah ... Neetish Kumar Maurya
08 Jul 2023
Geoenergy Science and Engineering | VOL. 229

Three-level evaluation method of cumulative slope deformation hybrid machine learning models and interpretability analysis
Zhi-Xing Deng ... Xian-Pu Xiao
Construction and Building Materials | VOL. 408
Zhi-Xing Deng, et. al.Zhi-Xing Deng ... Xian-Pu Xiao
17 Oct 2023
Construction and Building Materials | VOL. 408

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparison of Data Imputation Methods Utilizing Machine Learning for a New IoT System Platform

Abstract

Talk to us

Similar Papers