Missing values imputation in Arabic datasets using enhanced robust association rules

Awsan Salem,Azah Kamilah Muda,Zahriah Sahri,Abdulrazzak Ali,Nurul Akmar Emran

doi:10.11591/ijeecs.v28.i2.pp1067-1075

Abstract

Missing value (MV) is one form of data completeness problem in massive datasets. To deal with missing values, data imputation methods were proposed with the aim to improve the completeness of the datasets concerned. Data imputation's accuracy is a common indicator of a data imputation technique's efficiency. However, the efficiency of data imputation can be affected by the nature of the language in which the dataset is written. To overcome this problem, it is necessary to normalize the data, especially in non-Latin languages such as the Arabic language. This paper proposes a method that will address the challenge inherent in Arabic datasets by extending the enhanced robust association rules (ERAR) method with Arabic detection and correction functions. Iterative and Decision Tree methods were used to evaluate the proposed method in an experiment. Experiment results show that the proposed method offers a higher data imputation accuracy than the Iterative and Decision Tree methods.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Missing values imputation in Arabic datasets using enhanced robust association rules

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science

Lead the way for us

Journal: Indonesian Journal of Electrical Engineering and Computer Science	Publication Date: Nov 1, 2022
License type: cc-by-nc

Similar Papers

Software Implementation of Missing Data Recovery: Comparative Analysis
N V Kovtun ... A.-N Ya Fataliieva
Statistics of Ukraine | VOL. 91
N V Kovtun, et. al.N V Kovtun ... A.-N Ya Fataliieva
16 Dec 2020
Statistics of Ukraine | VOL. 91

Missing Data Imputation Through SGTM Neural-Like Structure for Environmental Monitoring Tasks
Oleksandra Mishchuk ... Ivan Izonin
-
Oleksandra Mishchuk, et. al.Oleksandra Mishchuk ... Ivan Izonin
29 Mar 2019
29 Mar 2019

A Pattern-Recognition-Based Ensemble Data Imputation Framework for Sensors from Building Energy Systems
Liang Zhang
Sensors | VOL. 20
Liang ZhangLiang Zhang
21 Oct 2020
Sensors | VOL. 20

Strategies for handling missing clinical data for automated surgical site infection detection from the electronic health record
Zhen Hu ... Gyorgy J Simon
Journal of Biomedical Informatics | VOL. 68
Zhen Hu, et. al.Zhen Hu ... Gyorgy J Simon
16 Mar 2017
Journal of Biomedical Informatics | VOL. 68

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Missing values imputation in Arabic datasets using enhanced robust association rules

Abstract

Talk to us

Similar Papers

More From: Indonesian Journal of Electrical Engineering and Computer Science