COMPARISON OF MACHINE LEARNING METHODS IN CLASSIFYING POVERTY IN INDONESIA IN 2018

Pardomuan Robinson Sihombing,Ade Marsinta Arsani

doi:10.20884/1.jutif.2021.2.1.52

Pardomuan Robinson Sihombing, Ade Marsinta Arsani

Open Access

https://doi.org/10.20884/1.jutif.2021.2.1.52

Copy DOI

Journal: Jurnal Teknik Informatika (Jutif)	Publication Date: Jan 18, 2021
Citations: 3	License type: CC BY 4.0

Affiliation: Badan Pusat Statistik

Abstract

Poverty is still one of the main problems in economic development besides inequality, unemployment, and economic growth. This study aims to model poverty directly using a discrete choice model, namely the machine learning classification method. The data used are imbalanced data where one of the categories is small enough so that the resample of both sampling method is used. In this study, several machine learning methods were applied, including the Decision Tree, Naïve Bayes, K-Nearest Neighbor (KNN), and Rotation Forest. The results show that the technique of using resample both samplings provides optimal results for the four machine learning methods. If viewed from the indicators of accuracy, specificity, sensitivity, AUC, and the highest Kappa coefficient produced, the best method is the KNN method. The KNN model has an accuracy value of 0.73 percent, sensitivity of 0.68 percent, specificity of 78 percent, and AUC of 0.73.

Highlights

Poverty is still one of the main problems in economic development besides inequality, unemployment, and economic growth
This study aims to model poverty directly using a discrete choice model, namely the machine learning classification method
The results show that the technique of using resample both samplings provides optimal results for the four machine learning methods

Summary

PENDAHULUAN kemiskinan tahun 2019 berdasarkan data Susenas

Kemiskinan masih menjadi salah satu masalah pokok dalam pembangunan ekonomi selain ketimpangan, pengangguran dan pertumbuhan ekonomi. Metode Desicion Tree mampu mengintegrasikan model yang mudah ke dalam sistem basis data serta memiliki akurasi yang baik serta dapat menemukan kombinasi data yang tidak terduga. Kurnia [8] mengklasifikasikan kemiskinan menggunakan metode KNN dengan tingkat akurasi hingga 90 persen. King dan Zeng [11] menyatakan bahwa ketika metode klasifikasi digunakan pada kasus imbalanced data, maka pengklasifikasian cenderung menihilkan peluang dari kelas minoritas karena nilai prediksi akan cenderung pada kelas mayoritas, sehingga tingkat ketepatan klasifikasi yang dihasilkan menjadi kurang baik. Pada penelitian ini mengkaji dan menerapkan beberapa metode machine learning seperti DT, NB, KNN dan RF dengan memperhatikan imbalanced data dan set data besar. Skema yang digunakan adalah menggunakan pembagaian data dengan metode deterministik (holdout) dengan melakukan resample kombinasi undersampling dan oversampling sekaligus (both/ combine sampling) dalam pemodelan klasifikasi status miskin rumah tangga di Indonesia

Modelling

Decision Tree

Naïve Bayes

K-Nearest Neighbor

Rotation Forest

Evaluasi

Tahapan dalam Analisis Data

HASIL DAN PEMBAHASAN

Pemilihan Model

Findings

KESIMPULAN

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

COMPARISON OF MACHINE LEARNING METHODS IN CLASSIFYING POVERTY IN INDONESIA IN 2018

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Jurnal Teknik Informatika (Jutif)

Lead the way for us

Similar Papers

Deposit type discrimination based on trace elements in sphalerite
Yu-Miao Meng ... Songning Meng
Ore Geology Reviews | VOL. 165
Yu-Miao Meng, et. al.Yu-Miao Meng ... Songning Meng
13 Jan 2024
Ore Geology Reviews | VOL. 165

A comparative study of machine learning methods for classifying ERP scalp distribution
Roya Salehzadeh ... Nader Jalili
Biomedical Physics & Engineering Express | VOL. 9
Roya Salehzadeh, et. al.Roya Salehzadeh ... Nader Jalili
16 Jun 2023
Biomedical Physics & Engineering Express | VOL. 9

An Efficient Approach to Predict Eye Diseases from Symptoms Using Machine Learning and Ranker-Based Feature Selection Methods.
Ahmed Al Marouf ... Jon Rokne
Bioengineering | VOL. 10
Ahmed Al Marouf, et. al.Ahmed Al Marouf ... Jon Rokne
24 Dec 2022
Bioengineering | VOL. 10

Sensors support machine learning
-
Food Science and Technology | VOL. 33
--
01 Dec 2019
Food Science and Technology | VOL. 33

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

COMPARISON OF MACHINE LEARNING METHODS IN CLASSIFYING POVERTY IN INDONESIA IN 2018

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Jurnal Teknik Informatika (Jutif)