Comparison between Statistical Approaches and Data Mining Algorithms for Outlier Detection

Annisa Putri Utami,Khairil Anwar Notodiputro,Anwar Fitrianto

doi:10.18860/ca.v9i1.25450

Abstract

Outliers are observation values that are very different from most observations. The presence of outliers in data can have a negative impact on research but can contain important information for other research. So, identifying outliers before conducting data analysis is a crucial thing to do. Outlier detection methods/techniques were first pioneered by researchers in statistics. However, due to rapid technological advances which have an impact on the ease of collecting extensive data, the development of outlier detection techniques is now handled mainly by researchers in the field of computer science (data mining) using computing facilities. This research aims to examine the results of simulation studies by comparing methods for identifying several outliers using statistical approaches and data mining algorithm approaches in various predetermined data scenarios. Based on the scenario carried out, the outlier detection method using a statistical approach is generally better than the outlier detection method using a data mining-based approach. Suggestions for further research are to improve the data mining method by focusing more on statistical analysis apart from focusing on data processing computing time so that the expected results of outlier detection are faster and more precise.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison between Statistical Approaches and Data Mining Algorithms for Outlier Detection

Abstract

Talk to us

Similar Papers

More From: CAUCHY: Jurnal Matematika Murni dan Aplikasi

Lead the way for us

Journal: CAUCHY: Jurnal Matematika Murni dan Aplikasi	Publication Date: May 16, 2024
License type: CC BY-SA 4.0

Similar Papers

Generalised linear model-based algorithm for detection of outliers in environmental data and comparison with semi-parametric outlier detection methods
Martina Čampulová ... Jiří Moučka
Atmospheric Pollution Research | VOL. 10
Martina Čampulová, et. al.Martina Čampulová ... Jiří Moučka
11 Jan 2019
Atmospheric Pollution Research | VOL. 10

There and back again: Outlier detection between statistical reasoning and data mining algorithms
Arthur Zimek ... Peter Filzmoser
WIREs Data Mining and Knowledge Discovery | VOL. 8
Arthur Zimek, et. al.Arthur Zimek ... Peter Filzmoser
20 Aug 2018
WIREs Data Mining and Knowledge Discovery | VOL. 8

Outlier Detection for Sensor Data Streams Based on Maximum Frequent and Minimum Rare Patterns
Xiaochen Shi ... Saihua Cai
-
Xiaochen Shi, et. al.Xiaochen Shi ... Saihua Cai
01 Jan 2020
01 Jan 2020

Outlier Detection in Growth Data: Beyond Biologically Implausible Values
Catherine Birken ... Robert Bandsma
Current Developments in Nutrition | VOL. 4
Catherine Birken, et. al.Catherine Birken ... Robert Bandsma
29 May 2020
Current Developments in Nutrition | VOL. 4

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison between Statistical Approaches and Data Mining Algorithms for Outlier Detection

Abstract

Talk to us

Similar Papers

More From: CAUCHY: Jurnal Matematika Murni dan Aplikasi