Sentiment Analysis Datasets Research Articles

Online lecture is a distance learning system that utilizes information technology in its implementation. Although it has been agreed, this lecture system has caused controversy. Not infrequently online lectures are considered to bring a variety of new obstacles in lectures, and not a few also consider that online lectures are the most appropriate solution to continue to run lecture activities in the midst of alarming pandemic conditions. In response to this policy, many people expressed various kinds of opinions and views on the implementation of online lectures which are generally stated on social media, one of which is through Twitter. Sentiment analysis is a branch of the science of machine learning that is carried out to obtain useful information or new knowledge by extracting, understanding, and processing text data automatically. Several methods are widely used by researchers to classify sentiment analysis datasets including K-Nearest Neighbor (K-NN). K-NN will be adapted to classify online lecture datasets because K-NN can produce good accuracy on a large number of data. The presence of feature selection also helps machine learning in improving its performance. The purpose of this study was to determine student sentiment toward online lectures and to determine the level of accuracy of the combination of K-NN with various feature selections. Based on 780 tweets data, a classification of 394 positive sentiments, 320 negative sentiments, and 39 neutral sentiments was obtained. So, the results of student opinions are on POSITIVE sentiments. The accuracy result of the K-NN Algorithm was 56% and the accuracy of the K-NN Algorithm + Forward Selection was 51%, the accuracy of the KNN Algorithm + Adabost was 54%, and the accuracy of the KNN Algorithm + Genetic Algorithm was 55%.

In the field of regulatory science, reviewing literature is an essential and important step, which most of the time is conducted by manually reading hundreds of articles. Although this process is highly time-consuming and labor-intensive, most output of this process is not well transformed into machine-readable format. The limited availability of data has largely constrained the artificial intelligence (AI) system development to facilitate this literature reviewing in the regulatory process.In the past decade, AI has revolutionized the area of text mining as many deep learning approaches have been developed to search, annotate, and classify relevant documents. After the great advancement of AI algorithms, a lack of high-quality data instead of the algorithms has recently become the bottleneck of AI system development.Herein, we constructed two large benchmark datasets, Chlorine Efficacy dataset (CHE) and Chlorine Safety dataset (CHS), under a regulatory scenario that sought to assess the antiseptic efficacy and toxicity of chlorine. For each dataset, ∼10,000 scientific articles were initially collected, manually reviewed, and their relevance to the review task were labeled. To ensure high data quality, each paper was labeled by a consensus among multiple experienced reviewers. The overall relevance rate was 27.21% (2,663 of 9,788) for CHE and 7.50% (761 of 10,153) for CHS, respectively. Furthermore, the relevant articles were categorized into five subgroups based on the focus of their content.Next, we developed an attention-based classification language model using these two datasets. The proposed classification model yielded 0.857 and 0.908 of Area Under the Curve (AUC) for CHE and CHS dataset, respectively. This performance was significantly better than permutation test (p < 10E-9), demonstrating that the labeling processes were valid. To conclude, our datasets can be used as benchmark to develop AI systems, which can further facilitate the literature review process in regulatory science.

Sentiment Analysis Datasets Research Articles

Related Topics

Articles published on Sentiment Analysis Datasets

PerceptSent - Exploring Subjectivity in a Novel Dataset for Visual Sentiment Analysis

Resource Construction and Ensemble Learning based Sentiment Analysis for the Low-resource Language Uyghur

Vietnamese Sentiment Analysis: An Overview and Comparative Study of Fine-tuning Pretrained Language Models

A comparison of multiple word embeddings and performance analysis

Implicit sentiment analysis based on affective knowledge and event information

SRL-ACO: A text augmentation framework based on semantic role labeling and ant colony optimization

Implicit Emotion Analysis Based on Improved Supervised Contrastive Learning

Natural Language Processing and Sentiment Analysis on Bangla Social Media Comments on Russia–Ukraine War Using Transformers

Multimodal Emotion Analysis Model based on Interactive Attention Mechanism

Lightweight multilayer interactive attention network for aspect-based sentiment analysis

Alternative Text Pre-Processing using Chat GPT Open AI

TF-TDA: A Novel Supervised Term Weighting Scheme for Sentiment Analysis

RoBERTa-GRU: A Hybrid Deep Learning Model for Enhanced Sentiment Analysis

Enhancing the Generalization for Text Classification through Fusion of Backward Features.

SART & COVIDSentiRo: Datasets for Sentiment Analysis Applied to Analyzing COVID-19 Vaccination Perception in Romanian Tweets

An Ontology-based Sentiment Analysis Approach To Discovering Hidden Affected Objects

Sentiment Analysis of Online Lectures using K-Nearest Neighbors based on Feature Selection

Sentiment analysis of Indonesian reviews using fine-tuning IndoBERT and R-CNN

Central Kurdish Sentiment Analysis Using Deep Learning

Development of benchmark datasets for text mining and sentiment analysis to accelerate regulatory literature review

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Sentiment Analysis Datasets Research Articles

Related Topics

Articles published on Sentiment Analysis Datasets

PerceptSent - Exploring Subjectivity in a Novel Dataset for Visual Sentiment Analysis

Resource Construction and Ensemble Learning based Sentiment Analysis for the Low-resource Language Uyghur

Vietnamese Sentiment Analysis: An Overview and Comparative Study of Fine-tuning Pretrained Language Models

A comparison of multiple word embeddings and performance analysis

Implicit sentiment analysis based on affective knowledge and event information

SRL-ACO: A text augmentation framework based on semantic role labeling and ant colony optimization

Implicit Emotion Analysis Based on Improved Supervised Contrastive Learning

Natural Language Processing and Sentiment Analysis on Bangla Social Media Comments on Russia–Ukraine War Using Transformers

Multimodal Emotion Analysis Model based on Interactive Attention Mechanism

Lightweight multilayer interactive attention network for aspect-based sentiment analysis

Alternative Text Pre-Processing using Chat GPT Open AI

TF-TDA: A Novel Supervised Term Weighting Scheme for Sentiment Analysis

RoBERTa-GRU: A Hybrid Deep Learning Model for Enhanced Sentiment Analysis

Enhancing the Generalization for Text Classification through Fusion of Backward Features.

SART & COVIDSentiRo: Datasets for Sentiment Analysis Applied to Analyzing COVID-19 Vaccination Perception in Romanian Tweets

An Ontology-based Sentiment Analysis Approach To Discovering Hidden Affected Objects

Sentiment Analysis of Online Lectures using K-Nearest Neighbors based on Feature Selection

Sentiment analysis of Indonesian reviews using fine-tuning IndoBERT and R-CNN

Central Kurdish Sentiment Analysis Using Deep Learning

Development of benchmark datasets for text mining and sentiment analysis to accelerate regulatory literature review