Performance Analysis of Embedding Methods for Deep Learning-Based Turkish Sentiment Analysis Models

Abdulfattah Ba Alawi,Ferhat Bozkurt

doi:10.1007/s13369-024-09360-4

Abstract

AbstractThe complex syntactic structure of Turkish text makes sentiment analysis in natural language processing (NLP) a challenging task. Conventional sentiment analysis methods often fail to effectively identify attitudes in Turkish texts, creating an urgent need for more efficient approaches. To fill this need, our study investigates the effectiveness of embedding techniques including pre-trained Turkish models such as Word2Vec, GloVe, and FastText in addition to two character-level embedding methods, namely, character-integer embedding (CIE) and character one-hot encoding embedding (COE), in conjunction with deep learning models specifically long short-term memory (LSTM), convolution neural networks (CNNs), bidirectional LSTM (Bi-LSTM), and hybrid models, for Turkish short-texts sentiment analysis. DL-based models were investigated on two datasets (e.g., an original Twitter (X) dataset and an accessible hotel reviews dataset). In addition to providing an intensive performance analysis of different embedding strategies and assessing their efficacy in dealing with the linguistic intricacies of Turkish, this study proposed a previously unexplored method in Turkish text representation that relies on a character-level one-hot encoding technique. The obtained findings indicate positive progress using a novel approach utilizing a dual-pathway architecture for both character level and word level that constitutes a substantial contribution to the area of natural language processing (NLP), specifically in the context of complex morphological languages. By employing a hybrid strategy that combines character and word levels on Twitter (X) data, the LSTM model obtained anF1 score of$$0.835 \pm 0.005$$0.835±0.005concerning cross-validation while CNN-BiLSTM attained the highestF1 Score (0.8392) using holdout validation. This strategy consistently produced modest improvements across the second public dataset (hotel reviews dataset) by emerging as the runner-up embedding technique in effectiveness, surpassed only by FastText. Findings provide practical recommendations for practitioners on how to effectively use sentiment analysis to make informed decisions by introducing an extensive performance analysis of the use of embedding techniques and deep learning models for sentiment analysis in Turkish texts, which is crucial in the current age of data analysis.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Arabian Journal for Science and Engineering	Publication Date: Aug 1, 2024
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Performance Analysis of Embedding Methods for Deep Learning-Based Turkish Sentiment Analysis Models

Abstract

Talk to us

Similar Papers

More From: Arabian Journal for Science and Engineering

Lead the way for us

Similar Papers

Gujarati Task Oriented Dialogue Slot Tagging Using Deep Neural Network Models
Rachana Parikh ... Hiren Joshi
-
Rachana Parikh, et. al.Rachana Parikh ... Hiren Joshi
01 Jan 2020
01 Jan 2020

Neural Network-based Approach to Predict Protein Secondary Structure
Arifur Rahman ... Pintu Chandra Shill
-
Arifur Rahman, et. al.Arifur Rahman ... Pintu Chandra Shill
04 May 2023
04 May 2023

Rice Crop Detection Using LSTM, Bi-LSTM, and Machine Learning Models from Sentinel-1 Time Series
Hugo Crisóstomo De Castro Filho ... Osmar Luiz Ferreira De Carvalho
Remote Sensing | VOL. 12
Hugo Crisóstomo De Castro Filho, et. al.Hugo Crisóstomo De Castro Filho ... Osmar Luiz Ferreira De Carvalho
18 Aug 2020
Remote Sensing | VOL. 12

Application of stacked and bidirectional long short-term memory deep learning models for wind speed forecasting at an offshore site
Bharat Kumar Saxena ... Komaragiri Venkata Subba Rao
Energy Sources, Part A: Recovery, Utilization, and Environmental Effects | VOL. ahead-of-print
Bharat Kumar Saxena, et. al.Bharat Kumar Saxena ... Komaragiri Venkata Subba Rao
26 Aug 2021
Energy Sources, Part A: Recovery, Utilization, and Environmental Effects | VOL. ahead-of-print

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Performance Analysis of Embedding Methods for Deep Learning-Based Turkish Sentiment Analysis Models

Abstract

Talk to us

Similar Papers

More From: Arabian Journal for Science and Engineering