Abstract

Human logical thinking is in the form of natural language. With the development of computer science technique, it becomes easier and more convenient for natural language processing. Therefore, a variety number of natural language processing applications have emerged. Sentiment analysis is one of these novel applications, and has been applied in many areas. In Amazon.com, there are a large number of user comments and product discussions, which can help a person to decide whether to buy a product or not without asking the opinions from friends and family members. Therefore, sentiment analysis on user comments and product discussions, such as Amazon review becomes increasingly useful and important. In this paper, the effect of corpus on sentiment analysis of Amazon review dataset with the aid of support vector machine are studied. We generate eight different size datasets from Amazon review dataset filtered by different word frequency in Corpus of Contemporary American, and conduct some experiments on these eight datasets. According to the experimental result, we make some conclusion and give some suggestions to facilitate researchers to make a trade-off between accuracy and experimental cost.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call