Abstract

ABSTRACT It is challenging to build domain-specific emotional lexicon for film reviews, due to its unique characteristics, such as massive data, endless new login words, and others. To improve the accuracy of film reviews classification, this article proposes a method for extending emotional lexicon based on word distance and point mutual information. First, using the improved K-means++ algorithm to cluster and select seed words with obvious emotional tendencies. Next, the Distance of Word and Point Mutual Information (DW-PMI) algorithm is presented to determine the emotional polarity of emotional words in the domain of film reviews. Four types of vocabulary, including degree adverb, negation, emoticon and emotion dictionary in the film reviews domain are added to the basic emotion dictionary to extend the film reviews emotional lexicon. From the experimental results, the expanded emotional lexicon of the Chinese film reviews can improve the accuracy and preciseness of the film reviews emotion analysis.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.