Abstract

Users of social media may use words and phrases literally to convey their views or opinion clearly. However, some people choose to utilise idioms or proverbs that are implicit and indirect in order to make a stronger impression on the audience or perhaps to catch their attention by utilising a funny, sarcastic, or metaphorical phrases. Idioms and proverbs are examples of figurative expressions with a thematically coherent totality that cannot be understood literally. In a previous work, the extension of IBM’s Sentiment Lexicon of Idiomatic Expressions was proposed to include around 9,000 idioms; both lexicons are manually annotated by crowdsourcing service. Therefore, in this research, we provide knowledge-based expansion approach to avoid human annotation of idioms. For sentiment classification, the proposed method has the advantage that it does not require any fine-tuning for the BERT model. Experimental comparisons show that the automated idiom enrichment and annotation are very beneficial for the performance of the sentiment classifier. The expanded annotated lexicon will be made available to the general public.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call