Abstract

Social media platforms are extensively used in exchanging and sharing information and user experience, thereby resulting in massive outspread and viewing of personal experiences in many fields of life. Thus, informative health-related videos on YouTube are highly perceptible. Many users tend to procure medical treatments and health-related information from social media particularly from YouTube when searching for chronic illness treatments. Sometimes, these sources contain misinformation that cause fatal effects on the users’ health. Many sentimental analyses and classifications have been conducted on social media platforms to study user post and comments on many life science fields. However, no study has been conducted on the analysis of Arabic user comments, which provide details on herbal treatments for people with diabetes. Therefore, this study proposes a model to detect and discover emotions/opinions of YouTube users on herbal treatment videos is proposed through an analysis of user comments by using machine learning classifiers. In addition, a new Arabic Dataset on Herbal Treatments for Diabetes (ADHTD), which is based on user comments from several YouTube videos, is introduced. This study examines the impact of four representation methods on ADHTD to show the performance of machine learning classifiers. These methods remove repeating characters in Arabic dialect and character extension known as ‘TATAWEEL’ or ‘MAD’, stemming of Arabic words, Arabic stop words removal and N-grams with Arabic words. Experiments has been conducted based aforementioned methods to handle imbalanced proposed dataset and identify the best machine learning classifiers over Arabic dialect textual data. The model has achieved a higher accuracy that reached 95% when using Synthetic Minority Oversampling TEchnique (SMTOE) techniques to balanced dataset than imbalanced dataset.

Highlights

  • Social media platforms occupy a large part of our daily life activities

  • In using SMOTE technique, the performance of the model has been significant improved with Machine Learning Classifiers (MLCs) except with k-Nearest Neighbours (KNN) were noticed the decrease of accuracy

  • Diabetes herbal treatment via YouTube has been detected through this study by analysis user opinions based on user comments

Read more

Summary

Introduction

Social media platforms occupy a large part of our daily life activities. They allow users to exchange information in seconds. YouTube is the most popular video platform, generating billions of views through the uploaded video content. More than two billion logged-in users visit YouTube every month, allowing people to share their perspectives on daily activities, thoughts, experiences, advertisements and educational resources (Bhuiyan et al, 2017; Burns et al, 2020). The advancing technology has made YouTube accessible to users. It has become popular among kids, adults and the elderly crowd as a mode of education and entertainment. Users face difficulties in differentiating the desirable and undesirable content due to the massive unstructured data on YouTube (Awal et al, 2018; Chen et al, 2017)

Objectives
Methods
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call