The Accuracy Comparison Between Word2Vec and FastText On Sentiment Analysis of Hotel Reviews

Siti Khomsah Siti Khomsah,Sena Wijaya Sena Wijaya,Rima Dias Ramadhani Rima Dias Ramadhani

doi:10.29207/resti.v6i3.3711

Siti Khomsah Siti Khomsah, Sena Wijaya Sena Wijaya + Show 1 more

Open Access

https://doi.org/10.29207/resti.v6i3.3711

Copy DOI

Abstract

Word embedding vectorization is more efficient than Bag-of-Word in word vector size. Word embedding also overcomes the loss of information related to sentence context, word order, and semantic relationships between words in sentences. Several kinds of Word Embedding are often considered for sentiment analysis, such as Word2Vec and FastText. Fast Text works on N-Gram, while Word2Vec is based on the word. This research aims to compare the accuracy of the sentiment analysis model using Word2Vec and FastText. Both models are tested in the sentiment analysis of Indonesian hotel reviews using the dataset from TripAdvisor.Word2Vec and FastText use the Skip-gram model. Both methods use the same parameters: number of features, minimum word count, number of parallel threads, and the context window size. Those vectorizers are combined by ensemble learning: Random Forest, Extra Tree, and AdaBoost. The Decision Tree is used as a baseline for measuring the performance of both models. The results showed that both FastText and Word2Vec well-to-do increase accuracy on Random Forest and Extra Tree. FastText reached higher accuracy than Word2Vec when using Extra Tree and Random Forest as classifiers. FastText leverage accuracy 8% (baseline: Decision Tree 85%), it is proofed by the accuracy of 93%, with 100 estimators.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)	Publication Date: Jun 30, 2022
Citations: 6	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The Accuracy Comparison Between Word2Vec and FastText On Sentiment Analysis of Hotel Reviews

Abstract

Talk to us

Similar Papers

More From: Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)

Lead the way for us

Similar Papers

Chinese Sentiment Analysis of MOOC Reviews Based on Word Vectors
Hua Yang
-
Hua YangHua Yang
01 Jun 2021
01 Jun 2021

Using word embeddings in Twitter election classification
Xiao Yang ... Iadh Ounis
Information Retrieval Journal | VOL. 21
Xiao Yang, et. al.Xiao Yang ... Iadh Ounis
09 Nov 2017
Information Retrieval Journal | VOL. 21

Word2Vec for Indonesian Sentiment Analysis towards Hotel Reviews: An Evaluation Study
Rizka Putri Nawangsari ... Adi Wibowo
Procedia Computer Science | VOL. 157
Rizka Putri Nawangsari, et. al.Rizka Putri Nawangsari ... Adi Wibowo
01 Jan 2019
Procedia Computer Science | VOL. 157

Software Functional Requirements Classification Using Ensemble Learning
Sanidhya Vijayvargiya ... Lalita Bhanu Murthy
-
Sanidhya Vijayvargiya, et. al.Sanidhya Vijayvargiya ... Lalita Bhanu Murthy
01 Jan 2021
01 Jan 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Accuracy Comparison Between Word2Vec and FastText On Sentiment Analysis of Hotel Reviews

Abstract

Talk to us

Similar Papers

More From: Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)