Abstract

Sentic Net is a popular resource for concept-level sentiment analysis. Because Sentic Net was created specifically for opinion mining in English language, however, its localization can be very laborious. In this work, a toolkit for creating non-English versions of Sentic Net in a time- and cost-effective way is proposed. This is achieved by exploiting online facilities such as Web dictionaries and translation engines. The challenging issues are three: firstly, when a Web lexicon is used, one sentiment concept in English can usually be mapped to multiple concepts in the local language. In this work, we develop a concept disambiguation algorithm to discover context within texts in the target language. Secondly, the polarity of some concepts in the local language may be different from the counterpart in English, which is referred to as language-dependent sentiment concepts. An algorithm is developed to detect sentiment conflict using sentiment annotation corpora in the two languages. Lastly, some sentiment concepts are not included in the local language after dictionary consulting and online translation. In this work, we develop a tool to extract these concepts from sentiment dictionary in the local language. Our practice and evaluation in constructing the Chinese version of Sentic Net indicate that the proposed algorithms represent an effective toolkit for localizing Sentic Net.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.