Abstract

Nowadays, based on mobile devices and internet, social network services (SNS) are common trends to everyone. Social opinions as public opinions are very important to the government, company, and a person. Analysis and decision of social polarity of SNS about social happenings, political issues and government policies, or commercial products is very critical to the government, company, and a person. Newly coined words and emoticons on SNS are created every day. Specifically, emoticons are made and sold by a person or companies. Newly coined words are mostly made and used by various kinds of communities. The SNS big data mainly consist of normal text with newly coined words and emoticons so that newly coined words and emoticons analysis is very important to understand the social and public opinions. Social big data is informally made and unstructured, and on social network services, many kinds of newly coined words and various emoticons are made anonymously and unintentionally by people and companies. In the analysis of social data, newly coined words and emoticons limit the guarantee the accuracy of analysis. The newly coined words implicitly contain the social opinions and trends of people. The emotional states of people significantly are expressed by emoticons. Although the newly coined words and emoticons are an important part of the social opinion analysis, they are excluded from the emotional dictionary and social big data analysis. In this research, newly coined words and emoticons are extracted from the raw Twitter’s twit messages and analyzed and included in a pre-built dictionary with the polarity and weight of the newly coined words and emoticons. The polarity and weight are calculated for emotional classification. The proposed emotional classification algorithm calculates the weight of polarity (positive or negative) and results in total polarity weight of social opinion. If the total polarity weight of social opinion is more than the pre-fixed threshold value, the twit message is decided as positive. If it is less than the pre-fixed threshold value, the twit message is decided as negative and the other values mean neutral opinion. The accuracy of the social big data analysis result is improved by quantifying and analyzing emoticons and newly coined words.

Highlights

  • As social networks services become popular and expresses social opinions more efficiently, social network service participants increase, and this increases the volume of social big data

  • Spoken words, idiomatic expressions, newly coined words, and emoticons are excluded from the emotional analysis process

  • The system is built on JAVA JDK 1.6 and CentOS 6.5

Read more

Summary

Introduction

As social networks services become popular and expresses social opinions more efficiently, social network service participants increase, and this increases the volume of social big data. The Korean government and global Korean company recognize social big data as important data and actively utilize it for customer’s needs and demands. For this reason, research on text mining, natural language processing, and emotional analysis is actively carried out. The newly coined words and emoticon express people’s emotions and opinions appropriately and are directly related to the social emotion.

Related Works
Blog 3 Blog
Collecting Social Data
Preprocessing Process
Morpheme Extraction
Emotional Classification
Discussion
Findings
Conclusions
Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call