Incorporating Background Checks with Sentiment Analysis to Identify Violence Risky Chinese Microblogs

Yun-Fei Jia,Shan Li,Renbiao Wu

doi:10.3390/fi11090200

Abstract

Based on Web 2.0 technology, more and more people tend to express their attitude or opinions on the Internet. Radical ideas, rumors, terrorism, or violent contents are also propagated on the Internet, causing several incidents of social panic every year in China. In fact, most of this content comprises joking or emotional catharsis. To detect this with conventional techniques usually incurs a large false alarm rate. To address this problem, this paper introduces a technique that combines sentiment analysis with background checks. State-of-the-art sentiment analysis usually depends on training datasets in a specific topic area. Unfortunately, for some domains, such as violence risk speech detection, there is no definitive training data. In particular, topic-independent sentiment analysis of short Chinese text has been rarely reported in the literature. In this paper, the violence risk of the Chinese microblogs is calculated from multiple perspectives. First, a lexicon-based method is used to retrieve violence-related microblogs, and then a similarity-based method is used to extract sentiment words. Semantic rules and emoticons are employed to obtain the sentiment polarity and sentiment strength of short texts. Second, the activity risk is calculated based on the characteristics of part of speech (PoS) sequence and by semantic rules, and then a threshold is set to capture the key users. Finally, the risk is confirmed by historical speeches and the opinions of the friend-circle of the key users. The experimental results show that the proposed approach outperforms the support vector machine (SVM) method on a topic-independent corpus and can effectively reduce the false alarm rate.

Highlights

With the rapid development of Web 2.0, more and more people retrieve and share information on social media
More and more violent threats are appearing on the Internet, especially through social media such as Chinese microblogs
Topic-independent sentiment analysis of Chinese short text is rarely reported in the literature

Summary

Introduction

With the rapid development of Web 2.0, more and more people retrieve and share information on social media. Its length is limited to 140 characters. This feature heightens user engagement in publishing their opinions more frequently and quickly. Most of the works provided in the literature depend on specific training data. They usually perform well only when there is a good match between the training and test data. Background check refers to sentiment analysis of historical microblogs of the key users and relevant opinions published by their internet friends (or circle of friends). A typical sign is longtime negative sentiment This can be determined via in-depth exploration of the personal details and historical microblogs of these key users.

Objectives

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Future Internet	Publication Date: Sep 19, 2019
Citations: 4	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Incorporating Background Checks with Sentiment Analysis to Identify Violence Risky Chinese Microblogs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future Internet

Lead the way for us

Similar Papers

A dual deep neural network with phrase structure and attention mechanism for sentiment analysis
Dongning Rao ... Rizwan Patan
Neural Computing and Applications | VOL. 33
Dongning Rao, et. al.Dongning Rao ... Rizwan Patan
11 Jan 2021
Neural Computing and Applications | VOL. 33

Chinese Micro-Blog Sentiment Analysis Based on Multiple Sentiment Dictionaries and Semantic Rule Sets
Jiesheng Wu ... Kui Lu
IEEE Access | VOL. 7
Jiesheng Wu, et. al.Jiesheng Wu ... Kui Lu
01 Jan 2019
IEEE Access | VOL. 7

A survey on Short text analysis in Web
P C Rafeeque ... S Sendhilkumar
-
P C Rafeeque, et. al.P C Rafeeque ... S Sendhilkumar
01 Dec 2011
01 Dec 2011

A versatile framework for resource-limited sentiment articulation, annotation, and analysis of short texts.
Vuk Batanović ... Boško Nikolić
PLOS ONE | VOL. 15
Vuk Batanović, et. al.Vuk Batanović ... Boško Nikolić
12 Nov 2020
PLOS ONE | VOL. 15

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Incorporating Background Checks with Sentiment Analysis to Identify Violence Risky Chinese Microblogs

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Future Internet