Abstract

There are many cases of email abuse that have the potential to harm others. This email abuse is commonly known as spam, which contains advertisements, phishing scams, and even malware. This study purpose to know the classification of email spam with ham using the KNN method as an effort to reduce the amount of spam. KNN can classify spam or ham in an email by checking it using a different K value approach. The results of the classification evaluation using confusion matrix resulted in the KNN method with a value of K = 1 having the highest accuracy value of 91.4%. From the results of the study, it is known that the optimization of the K value in KNN using frequency distribution clustering can produce high accuracy of 100%, while k-means clustering produces an accuracy of 99%. So based on the results of the existing accuracy values, the frequency distribution clustering and k-means clustering can be used to optimize the K-optimal value of the KNN in the classification of existing spam emails.

Highlights

  • There are many cases of email abuse that have the potential to harm others. This email abuse is commonly known as spam, which contains advertisements, phishing scams, and even malware

  • This study purpose to know the classification of email spam with ham using the K-Nearest Neighbor (KNN) method as an effort to reduce the amount of spam

  • The results of the classification evaluation using confusion matrix resulted in the KNN method with a value of K = 1 having the highest accuracy value of 91.4%

Read more

Summary

Metode Penelitian

Bagian ini merupakan tata urut proses penelitian yang dilakukan antara lain dimulai dari desain sitem, pengumpulan data, preprocessing, kemudian klasifikasi menggunakan KNN, serta evaluasi dan validasi hasil berikut : “Subject: 8434 the weather or climate in any particular environment can changed and affected what people eat”. Yang setiap sub babnya akan dijelaskan di bagian 2.3.

Desain Sistem dataset untuk mempermudah pemrosesan data yang
Tokenisasi
Stopword
Stemming
Optimasi Menggunakan K-Means Clustering
Hasil dan Pembahasan
Findings
Skenario C0
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.