Bangla Text Research Articles

Social networking platforms give users countless opportunities to share information, collaborate, and communicate positively. The same platform can be extended to a fabricated and poisonous atmosphere that gives an impersonal, harmful platform for online misuse and assault. Cyberstalking is when someone uses an internet system to ridicule, torment, insult, criticize, slander, and discredit a victim while never seeing them. With the growth of social networks, Facebook has become the online arena for bullying. Since the effects could result in a widespread contagion, it is vital to have models and mechanisms in place for the automatic identification and removal of internet cyberbullying data. This paper presents a robust hybrid ML model for cyberbullying detection in the Bengali language on social media. The Bengalibullying proposal involves an effective text preprocessing to make the Bengali text data into a useful text format, feature extraction using the TfidfVectorizer (TFID) to get the beneficial information of text data and resampling by Instance Hardness Threshold (IHT) procedure to balance the dataset to avoid overfitting or underfitting problems. In our experiment, we used the publicly available Bangla text dataset (44,001 comments) and got the highest performance ever published works on it. The model achieved the most elevated accuracy rate of 98.57% and 98.82% in binary and multilabel classification to detect cyberbullying on social media in the Bengali language. Our best performance findings are more effective than any previous effort in identifying and categorizing bullying in the Bengali language. As a result, we might use our model to correctly classify Bengali bullying in online bullying detection systems, protecting people from being the targets of social bullying.

Read full abstract

The effortless expansion of Internet access has eventually transformed the dissemination behavior toward E-Mode. Thus, the usage of online or, more specifically, “Digital” texts has expanded abruptly. “Bangla,” the seventh most spoken language globally, has no different nature. Communication in the Bangla language has also been exposed on the Internet, which describes the feelings of individuals in any specific context. These enormously generated data from diverse sources have drawn the interest of the researchers working in the Natural Language Processing domain. Despite its relatively complicated structure, a lesser amount of annotated data, as well as a limited number of frameworks and approaches, exist. This lacking of resources has kept several stones unturned in this diverse, emotion-rich, and widely spoken language. To bridge the lacking and absence of resources, this article aims to provide a generalized deduced working procedure in this domain. To do so, the existing research work in the domain of sentiment analysis using Bangla text has been collected, evaluated, and summarized. Also, in this article, the techniques used in pre-processing, feature extraction, and eventually used algorithms have been identified and discussed. Considering these facts, this research work sketches a tentative blueprint of sentiment analysis using Bangla text. Additionally, this article discusses existing regional language corpora such as Tamil, Urdu, and Hindi, as well as English and methodologies used to extract emotional essence from Bangla language comparing other languages. That will assist in determining the probable chosen path of exploring Bangla in a deeper aspect. Moreover, this work has deduced and presented a generalized framework that will direct aspiring researchers to decide the pathway of choosing data vis-à-vis methodologies based on their interests.

Read full abstract

Bangla Text Research Articles

Related Topics

Articles published on Bangla Text

Accurate Prediction of Bangla Text Article Categorization by Utilizing Novel Bangla Stemmer

An Upgraded Approach for Identifying Partially Reduplicated Forms in Bengali Text

Hate speech detection in the Bengali language: a comprehensive survey

Depression Intensity Identification using Transformer Ensemble Technique for the Resource-constrained Bengali Language

Detecting cyberbullying text using the approaches with machine learning models for the low-resource Bengali language

Dialectics of impairment: historical anxieties in late-colonial Bengali fictional narratives on disability

Analyzing Sentiments in eLearning: A Comparative Study of Bangla and Romanized Bangla Text Using Transformers

Classifying Bengali Newspaper Headlines with Advanced Deep Learning Models: LSTM, Bi-LSTM, and Bi-GRU Approaches

A transformer-based generative adversarial learning to detect sarcasm from Bengali text with correct classification of confusing text

A crowdsource based framework for Bengali scene text data collection and detection

Bangla text normalization for text-to-speech synthesizer using machine learning algorithms

Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models

An improved extrinsic monolingual plagiarism detection approach of the Bengali text

A robust hybrid machine learning model for Bengali cyber bullying detection in social media

Strategies for enhancing the performance of news article classification in Bangla: Handling imbalance and interpretation

Lexeme connexion measure of cohesive lexical ambiguity revealing factor: a robust approach for word sense disambiguation of Bengali text

A Voting classification approach for Sentiment Extraction from Bengali text

A Comprehensive Roadmap on Bangla Text-based Sentiment Analysis

Englishization of the Bangla Print Advertisement: An Ideological Apparatus for the Capitalist Regime

CovTiNet: Covid text identification network using attention-based positional embedding feature fusion.

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Bangla Text Research Articles

Related Topics

Articles published on Bangla Text

Accurate Prediction of Bangla Text Article Categorization by Utilizing Novel Bangla Stemmer

An Upgraded Approach for Identifying Partially Reduplicated Forms in Bengali Text

Hate speech detection in the Bengali language: a comprehensive survey

Depression Intensity Identification using Transformer Ensemble Technique for the Resource-constrained Bengali Language

Detecting cyberbullying text using the approaches with machine learning models for the low-resource Bengali language

Dialectics of impairment: historical anxieties in late-colonial Bengali fictional narratives on disability

Analyzing Sentiments in eLearning: A Comparative Study of Bangla and Romanized Bangla Text Using Transformers

Classifying Bengali Newspaper Headlines with Advanced Deep Learning Models: LSTM, Bi-LSTM, and Bi-GRU Approaches

A transformer-based generative adversarial learning to detect sarcasm from Bengali text with correct classification of confusing text

A crowdsource based framework for Bengali scene text data collection and detection

Bangla text normalization for text-to-speech synthesizer using machine learning algorithms

Sentiment analysis in multilingual context: Comparative analysis of machine learning and hybrid deep learning models

An improved extrinsic monolingual plagiarism detection approach of the Bengali text

A robust hybrid machine learning model for Bengali cyber bullying detection in social media

Strategies for enhancing the performance of news article classification in Bangla: Handling imbalance and interpretation

Lexeme connexion measure of cohesive lexical ambiguity revealing factor: a robust approach for word sense disambiguation of Bengali text

A Voting classification approach for Sentiment Extraction from Bengali text

A Comprehensive Roadmap on Bangla Text-based Sentiment Analysis

Englishization of the Bangla Print Advertisement: An Ideological Apparatus for the Capitalist Regime

CovTiNet: Covid text identification network using attention-based positional embedding feature fusion.