Abstract

Aspect Based Sentiment Analysis techniques have been applied in several application domains. From the last two decades, these techniques have been developed mostly for product and service application domains. However, very few aspect-based sentiment techniques have been proposed for the movie application domain. Moreover, these techniques only mine specific aspects (Script, Director, and Actor) of a movie application domain, nevertheless, the movie application domain is more complex than the product and service application domain. Since, it contains NER (Named Entity Recognition) problem and it cannot be ignored, since there is an opinion often associated with it. Consequently, in this paper MAIM (Movie Aspect Identification Model) is proposed that can extract not only movie specific aspects, also identifies NEs (Named Entities) such as Person Name and Movie Title. The three main contributions are 1) the identification of infrequent aspects, 2) the identification of NE (named entity) in movie application domain, 3) identifying N-gram opinion words as an entity. MAIM incorporates the BiLSTM-CRF hybrid technique and is implemented on the movie application domain having precision 89.9%, recall 88.9% and f1-measure 89.4%. The experimental results show that MAIM performs better than baseline models CRF and LSTM-CRF.

Highlights

  • The most intelligent beings in this planet earth are humans since, naturally, they possess cognitive or decision making ability [22]

  • Etaiwi et al [23] has proposed a model for the identification of Arabic Names and states that the CRF is very efficient in identifying name entity recognition (NER) problem, since the movie application domain is based on sequential data and contains NER problem

  • For the MOVIE named entity, Movie Aspects Identification Model (MAIM) (BiLSTM) performs better in F1-measure and reported 6% and 2% more F1-measure than CRF and LSTM-CRF

Read more

Summary

Introduction

The most intelligent beings in this planet earth are humans since, naturally, they possess cognitive or decision making ability [22]. Information scientists have proposed sentiment analysis techniques that can extract sentiments from a large number of reviews. These sentiment analysis techniques can be categorized into three levels, these levels being Document-level, sentence-level and aspect level. If a review says, “The battery life of iPhone is very nice”, the sentence level sentiment analysis would suggest that the whole sentence is subjective, since it contains opinion It would not provide any detail of the iPhones aspects. Several ABSTs (Aspect Based Sentiment Analysis Techniques) have been proposed for different application domains. The movie application domain is more complex than product and service application domain and it contains NER (Named Entity Recognition) problem, indirect speech, and noisy content. The MAIM is using BiLSTM-CRF (Bidirectional Long Short Term Memory – Conditional Random Field) hybrid NERC (Named Entity Recognition and Classification) technique, for the identification of Person Name and Movie Titles in movie application domain

Aspect Identification Techniques for Movie Application Domain
Named Entity Recognition Techniques
Aspect Identification Techniques for Different Application Domains
Critical Analysis and Limitations of ABSTs
Linguistic Features
Orthographic Features
IMDb Database
Semantic Similarity Matrix
MAIM Training and Aspect
Experimental Results
Result Comparison
Conclusion
Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call