Text Datasets Research Articles

The paper proposes the basic principles of developing an object-oriented information system for detecting abusive content in Ukrainian-language audio content based on a new algorithm that uses statistical and neural network approaches to detect abusive content. Detecting offensive content in text and audio content is an urgent task, as it helps to create a safe and healthy environment for communication, especially in online platforms. Offensive content can harm the people who hear or read it and violate their rights. It can also have a negative impact on society, contributing to the spread of hatred and violence. To detect abusive speech in audio content, the proposed approach uses two key components: the use of dictionary methods and the analysis of the emotional tonality of utterances. A set of reviews was used as a dataset to determine the abusive component of the content, which was expanded by the authors by adding words of abuse. An object-oriented information system architecture written in the Python programming language in the PyCharm programming environment is proposed. The information system consists of a software module for training recurrent neural network models and further saving trained instances, and a software module for detecting abusive content in Ukrainian-language audio content using trained RNN models. Since the recurrent neural network is trained on a short text data set, the system is less efficient at identifying texts that have a larger number of words. In the example of the proposed approach, the accuracy of detecting offensive content is more than 90%. This means that the algorithm works correctly in the absence of offending highlights in the test data set. The results of the analysis of the effectiveness of the proposed approach show that in the vast majority of cases the conclusions regarding the acceptability of audio content based on the level of abuse are correct.

Read full abstract

Recent breakthroughs in natural language processing and machine learning, exemplified by ChatGPT, have spurred a paradigm shift in healthcare. Released by OpenAI in November 2022, ChatGPT rapidly gained global attention. Trained on massive text datasets, this large language model holds immense potential to revolutionize healthcare. However, existing literature often overlooks the need for rigorous validation and real-world applicability. This head-to-head comparative study assesses ChatGPT's capabilities in providing therapeutic recommendations for head and neck cancers. Simulating every NCCN Guidelines scenarios. ChatGPT is queried on primary treatments, adjuvant treatment, and follow-up, with responses compared to the NCCN Guidelines. Performance metrics, including sensitivity, specificity, and F1 score, are employed for assessment. The study includes 68 hypothetical cases and 204 clinical scenarios. ChatGPT exhibits promising capabilities in addressing NCCN-related queries, achieving high sensitivity and overall accuracy across primary treatment, adjuvant treatment, and follow-up. The study's metrics showcase robustness in providing relevant suggestions. However, a few inaccuracies are noted, especially in primary treatment scenarios. Our study highlights the proficiency of ChatGPT in providing treatment suggestions. The model's alignment with the NCCN Guidelines sets the stage for a nuanced exploration of AI's evolving role in oncological decision support. However, challenges related to the interpretability of AI in clinical decision-making and the importance of clinicians understanding the underlying principles of AI models remain unexplored. As AI continues to advance, collaborative efforts between models and medical experts are deemed essential for unlocking new frontiers in personalized cancer care.

Read full abstract

Text Datasets Research Articles

Articles published on Text Datasets

CoLAL: Co-learning Active Learning for Text Classification

Research on blockchain abnormal transaction detection technology combining CNN and transformer structure

TSIC-CLIP: Traffic Scene Image Captioning Model Based on Clip

A novel approach to texture recognition combining deep learning orthogonal convolution with regional input features.

Short text classification using semantically enriched topic model

Neural collapse inspired semi-supervised learning with fixed classifier

Enhancing Document Clustering with Hybrid Recurrent Neural Networks and Autoencoders: A Robust Approach for Effective Semantic Organization of Large Textual Datasets

The Language of Responsibility in the United Nations Security Council, 1946–2020

Proposed Machine learning model for predicting Egyptian Parliament Election Results

Machine Learning Based Stroke Predictor Application

The performance of large language models in intercollegiate Membership of the Royal College of Surgeons examination.

Automatic analysis of public health service text based on character level convolutional neural network

Human-like cognition: visual features grouping for hard-to-group text dataset

Text Attribute Control via Closed-Loop Disentanglement

Improving Medical Speech-to-Text Accuracy using Vision-Language Pre-training Models.

Taking the Road Less Travelled: How Corpus‐Assisted Discourse Studies Can Enrich Qualitative Explorations of Large Textual Datasets

No time to lie: Examining the identity of pro-vaccination and anti-vaccination supporters through user-generated content

ALGORITHM FOR DETECTION OF ABUSIVE CONTENT IN AUDIO CONTENT FOR IMPLEMENTATION IN OBJECT-ORIENTED INFORMATION SYSTEM

Exploring the landscape of AI-assisted decision-making in head and neck cancer treatment: a comparative analysis of NCCN guidelines and ChatGPT responses.

Cross‐modal knowledge learning with scene text for fine‐grained image classification

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Text Datasets Research Articles

Articles published on Text Datasets

CoLAL: Co-learning Active Learning for Text Classification

Research on blockchain abnormal transaction detection technology combining CNN and transformer structure

TSIC-CLIP: Traffic Scene Image Captioning Model Based on Clip

A novel approach to texture recognition combining deep learning orthogonal convolution with regional input features.

Short text classification using semantically enriched topic model

Neural collapse inspired semi-supervised learning with fixed classifier

Enhancing Document Clustering with Hybrid Recurrent Neural Networks and Autoencoders: A Robust Approach for Effective Semantic Organization of Large Textual Datasets

The Language of Responsibility in the United Nations Security Council, 1946–2020

Proposed Machine learning model for predicting Egyptian Parliament Election Results

Machine Learning Based Stroke Predictor Application

The performance of large language models in intercollegiate Membership of the Royal College of Surgeons examination.

Automatic analysis of public health service text based on character level convolutional neural network

Human-like cognition: visual features grouping for hard-to-group text dataset

Text Attribute Control via Closed-Loop Disentanglement

Improving Medical Speech-to-Text Accuracy using Vision-Language Pre-training Models.

Taking the Road Less Travelled: How Corpus‐Assisted Discourse Studies Can Enrich Qualitative Explorations of Large Textual Datasets

No time to lie: Examining the identity of pro-vaccination and anti-vaccination supporters through user-generated content

ALGORITHM FOR DETECTION OF ABUSIVE CONTENT IN AUDIO CONTENT FOR IMPLEMENTATION IN OBJECT-ORIENTED INFORMATION SYSTEM

Exploring the landscape of AI-assisted decision-making in head and neck cancer treatment: a comparative analysis of NCCN guidelines and ChatGPT responses.

Cross‐modal knowledge learning with scene text for fine‐grained image classification