Perbandingan Metode TF-ABS dan TF-IDF Pada Klasifikasi Teks Helpdesk Menggunakan K-Nearest Neighbor

Riza Adrianti Supono Riza Adrianti Supono,Muhammad Azis Suprayogi Muhammad Azis Suprayogi

doi:10.29207/resti.v5i5.3403

Riza Adrianti Supono Riza Adrianti Supono, Muhammad Azis Suprayogi Muhammad Azis Suprayogi

Open Access

https://doi.org/10.29207/resti.v5i5.3403

Copy DOI

Abstract

Distribution of tickets to the destination unit is a very important function in the helpdesk application, but the process of distributing tickets manually by admin officers has drawbacks, namely ticket distribution errors can occur and increase ticket completion time if the number of tickets is large. Helpdesk text classification becomes important to automatically distribute tickets to the appropriate destination units in a short time. This study was conducted to compare the performance of helpdesk text classification at the Directorate General of State Assets of the Ministry of Finance using the K-Nearest Neighbor (KNN) method with the TF-ABS and TF-IDF weighting methods. The research was conducted by collecting complaint documents, preprocessing, word weighting, feature reduction, classification, and testing. Classification using KNN with parameters n_neighbor (k) namely k=1, k=3, k=5, k=7, k=9, k=11, k=13, k=15, k=17, and k=19 to classify 10,537 helpdesk texts into 8 categories. The test uses a confusion matrix based on the accuracy value and score-f1. The test results show that the TF-ABS weighting method is better than TF-IDF with the highest accuracy value of 90.04% at 15% and k=3.

Highlights

Distribution of tickets to the destination unit is a very important function in the helpdesk application, but the process of distributing tickets manually by admin officers has drawbacks, namely ticket distribution errors can occur and increase ticket completion time if the number of tickets is large
This study was conducted to compare the performance of helpdesk text classification at the Directorate General
The research was conducted by collecting complaint documents

Summary

Uraian insiden Mohon informasi mengenai surat dengan

Berdasarkan beberapa penelitian tersebut, maka penelitian ini dilakukan untuk membandingkan performa klasifikasi teks helpdesk menggunakan metode pembobotan kata TF-ABS dan TF-IDF. Atribut method str.lower() dari library Pandas, selanjutnya yang dibutuhkan untuk klasifikasi teks tiket helpdesk proses tokenization dilakukan menggunakan method adalah uraian isi tiket dan kategori tujuan tiket. Mengingat bahwa fokus pada kategorisasi sebuah term terhadap dokumen, semakin banyak term teks adalah untuk kata yang terdistribusi secara berbeda tersebut muncul pada dokumen maka semakin tinggi pada kategori ck dan ck, tidak penting apakah term nilai term tersebut[12]. Inverse Document Frequency (IDF) yang berfungsi efektif dan efisien dalam menyiapkan data berdimensi untuk mengurangi bobot term yang jumlah tinggi untuk permasalahan data mining dan machine kemunculannya banyak di seluruh dokumen learning dengan tujuan untuk membangun model yang menggunakan Persamaan (1).

Uraian dokumen j

Case Folding

Positif Negatif

Term TF

Actual serta

Findings

Jumlah fitur

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Perbandingan Metode TF-ABS dan TF-IDF Pada Klasifikasi Teks Helpdesk Menggunakan K-Nearest Neighbor

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)

Lead the way for us

Journal: Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)	Publication Date: Oct 24, 2021
License type: CC BY 4.0

Similar Papers

Comparison Of K-nearest Neighbor (KNN) And Linear Discriminant Analysis (LDA) Algorithms For Mature Ajwa Date Fruit Classification
Risna Risna ... Shofwatul Uyun
International Conference on Information Science and Technology Innovation (ICoSTEC) | VOL. 2
Risna Risna, et. al.Risna Risna ... Shofwatul Uyun
05 Mar 2023
International Conference on Information Science and Technology Innovation (ICoSTEC) | VOL. 2

Classification based on K-Nearest Neighbor and Logistic Regression method of coffee using Electronic Nose
D R Prehanto ... I K D Nuryana
IOP Conference Series: Materials Science and Engineering | VOL. 1098
D R Prehanto, et. al.D R Prehanto ... I K D Nuryana
01 Mar 2021
IOP Conference Series: Materials Science and Engineering | VOL. 1098

Classification of arrhythmias using spectral features with K Nearest Neighbor method
Irem Hilavin ... Mehmet Kuntalp
-
Irem Hilavin, et. al.Irem Hilavin ... Mehmet Kuntalp
01 Apr 2011
01 Apr 2011

Optimizing Collaborative Filtering by Interpolating the Individual and Group Behaviors
Xue-Mei Jiang ... Wei-Guo Feng
-
Xue-Mei Jiang, et. al.Xue-Mei Jiang ... Wei-Guo Feng
01 Jan 2006
01 Jan 2006

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Perbandingan Metode TF-ABS dan TF-IDF Pada Klasifikasi Teks Helpdesk Menggunakan K-Nearest Neighbor

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)