Experimental of vectorizer and classifier for scrapped social media data

Setiawan Assegaff,Yovi Pratama,Errissya Rasywir

doi:10.12928/telkomnika.v21i4.24180

Setiawan Assegaff, Yovi Pratama + Show 1 more

Open Access

https://doi.org/10.12928/telkomnika.v21i4.24180

Copy DOI

Abstract

In this study, we used several classifiers and vectorizers to see their effect on processing social media data. In this study, the classifiers used were random forest, logistic regression, Bernoulli Naive Bayes (NB), and support vector clustering (SVC). Random forests are used to reduce spatial complexity, and also to minimize errors. Logistic regression is a method with a statistical model whose basic form uses a logistic function to represent the binary dependent variable. Then, the Naive Bayes function uses binary elements and SVC which has so far given good results rivals other guided learning. Our tests use social media data. Based on the tests that have been carried out on classifier variations and vectorizer variations, it was found that the best classifier is a linear regression algorithm based on predictive adaptive compared to the random forest method based on decision trees, probability-based Bernoulli NB and SVC which work by clustering. Meanwhile, from the test results on the count vectorizer, term frequency-inverse document frequency (TFIDF), and hashing, the best accuracy is achieved on the TFIDF vectorizer. In this case, it means that the TFIDF vectorizer has a better value in presenting word feature dimensions.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Experimental of vectorizer and classifier for scrapped social media data

Abstract

Talk to us

Similar Papers

More From: TELKOMNIKA (Telecommunication Computing Electronics and Control)

Lead the way for us

Journal: TELKOMNIKA (Telecommunication Computing Electronics and Control)	Publication Date: Aug 1, 2023
License type: cc-by-sa

Similar Papers

Applications of Machine Learning Techniques to Predict Diagnostic Breast Cancer
Vikas Chaurasia ... Saurabh Pal
SN Computer Science | VOL. 1
Vikas Chaurasia, et. al.Vikas Chaurasia ... Saurabh Pal
14 Aug 2020
SN Computer Science | VOL. 1

Machine Learning for Predictive Analysis of Otolaryngology Residency Letters of Recommendation.
Vikram Vasan ... Marita S Teng
The Laryngoscope | VOL. 134
Vikram Vasan, et. al.Vikram Vasan ... Marita S Teng
11 Apr 2024
The Laryngoscope | VOL. 134

Comparison of Multinomial Naïve Bayes with K-Nearest Neighbors, Support Vector Machine and Random Forest for Classification of “Network Attacks” Document
Bambang Harjito ... Budi Murtiyas
-
Bambang Harjito, et. al.Bambang Harjito ... Budi Murtiyas
01 Oct 2019
01 Oct 2019

Machine Learning Approach for COVID-19 Detection on Twitter
Samina Amin ... M Irfan Uddin
Computers, Materials & Continua | VOL. 68
Samina Amin, et. al.Samina Amin ... M Irfan Uddin
01 Jan 2020
Computers, Materials & Continua | VOL. 68

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Experimental of vectorizer and classifier for scrapped social media data

Abstract

Talk to us

Similar Papers

More From: TELKOMNIKA (Telecommunication Computing Electronics and Control)