Abstract

Owing to the emergence of the Internet and its rapid growth, people can use mobile devices on many social media platforms (blogs, Facebook forums, etc.), and the platforms provide well-known websites for people to express and share their daily activities and ideas on global issues. Many consumers utilize product review websites before making a purchase. Many well-known websites are searched for relevant product reviews and experiences of product use. We can easily collect large amounts of structured and unstructured product data and further analyze the data to determine the desired product information. For this reason, many researchers are gradually focusing on sentiment analysis or opinion exploration (opinion mining) and use this technique to extract and analyze customer opinions and emotions. This paper proposes a sentimental text mining method based on an additional features method to enhance accuracy and reduce implementation time and uses singular value decomposition and principal component analysis for data dimension reduction. This study has four contributions: (1) the proposed algorithm for preprocessing the data for sentiment classification, (2) the additional features to enhance the accuracy of the sentiment classification, (3) the application of singular value decomposition and principal component analysis for data dimension reduction, and (4) the design of five modules based on different features, with or without stemming, to compare the performance results. The experimental results show that the proposed method has better accuracy than other methods and that the proposed method can decrease the implementation time.

Highlights

  • The volume of data from social media and online activities is classified as big data, which allow us to collect a large amount of structured and unstructured data

  • Description the standard deviations of the PC the matrix of feature loadings the means of feature the standard deviations of feature The coordinates of the individuals on the principal components

  • This study proposed an additional feature method to enhance accuracy and the “singular value decomposition (SVD) Principle component analysis (PCA)” method to shorten the implementation time in sentimental text mining

Read more

Summary

Introduction

The volume of data from social media and online activities (e.g., chat rooms, e-commerce, and blogs) is classified as big data, which allow us to collect a large amount of structured and unstructured data. We must extract and analyze the collected data, and this trend refers to big data. Many researchers have proposed automatic text categorization and data analysis methods; these techniques include data mining, web mining, and text mining. The datasets regarding customers’ opinions or reviews are often massive and hard to analyze; it requires additional approaches to summarize them. Many forums, product marketing websites, mobile applications, e-commerce websites, and related web resources have.

Objectives
Methods
Results
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call