Multi-lingual opinion mining on YouTube

Aliaksei Severyn,Alessandro Moschitti,Olga Uryupina,Barbara Plank,Katja Filippova

doi:10.1016/j.ipm.2015.03.002

Abstract

In order to successfully apply opinion mining (OM) to the large amounts of user-generated content produced every day, we need robust models that can handle the noisy input well yet can easily be adapted to a new domain or language. We here focus on opinion mining for YouTube by (i) modeling classifiers that predict the type of a comment and its polarity, while distinguishing whether the polarity is directed towards the product or video; (ii) proposing a robust shallow syntactic structure (STRUCT) that adapts well when tested across domains; and (iii) evaluating the effectiveness on the proposed structure on two languages, English and Italian. We rely on tree kernels to automatically extract and learn features with better generalization power than traditionally used bag-of-word models. Our extensive empirical evaluation shows that (i) STRUCT outperforms the bag-of-words model both within the same domain (up to 2.6% and 3% of absolute improvement for Italian and English, respectively); (ii) it is particularly useful when tested across domains (up to more than 4% absolute improvement for both languages), especially when little training data is available (up to 10% absolute improvement) and (iii) the proposed structure is also effective in a lower-resource language scenario, where only less accurate linguistic processing tools are available.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Information Processing & Management	Publication Date: Apr 9, 2015
Citations: 101	License type: other-oa

R Discovery Prime

R Discovery Prime

Multi-lingual opinion mining on YouTube

Abstract

Talk to us

Similar Papers

More From: Information Processing & Management

Lead the way for us

Similar Papers

Social information discovery enhanced by sentiment analysis techniques
Claudia Diamantini ... Emanuele Storti
Future Generation Computer Systems | VOL. 95
Claudia Diamantini, et. al.Claudia Diamantini ... Emanuele Storti
06 Feb 2018
Future Generation Computer Systems | VOL. 95

Bi-directional emotional contagion: An analysis of chinese parents’ social media data
Wenwei Luo ... Michael J Berson
Computers and Education Open | VOL. 3
Wenwei Luo, et. al.Wenwei Luo ... Michael J Berson
08 Jun 2022
Computers and Education Open | VOL. 3

Visualization Tool for Interpreting User Needs From User-Generated Content via Text Mining and Classification
Thomas Stone ... Seung-Kyum Choi
-
Thomas Stone, et. al.Thomas Stone ... Seung-Kyum Choi
17 Aug 2014
17 Aug 2014

Semantic sentiment analysis of microblogs

-

22 Jun 2015
22 Jun 2015

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Multi-lingual opinion mining on YouTube

Abstract

Talk to us

Similar Papers

More From: Information Processing &amp; Management

More From: Information Processing & Management