Optimal Number Data Trains in Hoax News Detection of Indonesian using SVM and Word2Vec

Muhammad Sulthon Asramanggala,Sri Suryani Prasetyowati,Yuliant Sibaroni

doi:10.47065/bits.v5i1.3516

Muhammad Sulthon Asramanggala, Sri Suryani Prasetyowati + Show 1 more

Open Access

https://doi.org/10.47065/bits.v5i1.3516

Copy DOI

Abstract

Along with the development of the era of technological development also has an increase. Information dissemination occurs very quickly on social media, especially Twitter. On Twitter, only some news circulating is necessarily accurate information. Lots of information that is spread is hoax news that irresponsible individuals apply. In this research, the author will build a system to determine the optimal amount of data trained in the hoax news classification process. In this study, the authors will use the support vector machine and word2vec algorithms to classify hoax and non-hoax news on the system to be created. In this study, five experiments were carried out with the number of train data used as many as 5000, 10000, 15000, 20000, and 25000. 5000 data train results in an accuracy of 77.28%, 10000 data train produce an accuracy of 79.68%, data 15,000 trains produce an accuracy of 79.892%, 20,000 data trains produce an accuracy of 80,416%, and 25,000 data trains produce an accuracy of 81,184%, by using a combination of unigram with token full token selection. This research aims to build a hoax detection system that can determine the optimal amount of data training to use. Also, this research is used to see the performance of the Support Vector Machine algorithm with Word2Vec in detecting hoax news

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Optimal Number Data Trains in Hoax News Detection of Indonesian using SVM and Word2Vec

Abstract

Talk to us

Similar Papers

More From: Building of Informatics, Technology and Science (BITS)

Lead the way for us

Journal: Building of Informatics, Technology and Science (BITS)	Publication Date: Jun 28, 2023
License type: CC BY 4.0

Similar Papers

Comparison of support vector machine and K-Nearest Neighbor algorithms in Indonesia hoax classification
Inte Christinawati Bu'Ulolo ... Wira Epriana Ambarita
-
Inte Christinawati Bu'Ulolo, et. al.Inte Christinawati Bu'Ulolo ... Wira Epriana Ambarita
01 Jan 2021
01 Jan 2021

Ternion: An Autonomous Model for Fake News Detection
Noman Islam ... Asadullah Shaikh
Applied Sciences | VOL. 11
Noman Islam, et. al.Noman Islam ... Asadullah Shaikh
06 Oct 2021
Applied Sciences | VOL. 11

Fake News Detection system using Decision Tree algorithm and compare textual property with Support Vector Machine algorithm
N Leela Siva Rama Krishna ... M Adimoolam
-
N Leela Siva Rama Krishna, et. al.N Leela Siva Rama Krishna ... M Adimoolam
16 Feb 2022
16 Feb 2022

Comparison of NB and SVM in Sentiment Analysis of Cyberbullying using Feature Selection
Selamet Riadi ... Ema Utami
sinkron | VOL. 8
Selamet Riadi, et. al.Selamet Riadi ... Ema Utami
01 Oct 2023
sinkron | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Optimal Number Data Trains in Hoax News Detection of Indonesian using SVM and Word2Vec

Abstract

Talk to us

Similar Papers

More From: Building of Informatics, Technology and Science (BITS)