Sarcasm detection using news headlines dataset

Rishabh Misra,Prahal Arora

doi:10.1016/j.aiopen.2023.01.001

Rishabh Misra, Prahal Arora

Open Access

https://doi.org/10.1016/j.aiopen.2023.01.001

Copy DOI

Journal: AI Open	Publication Date: Jan 1, 2023
Citations: 27	License type: cc-by-nc-nd

Affiliation: Twitter (United States), Meta (United States)

Abstract

Sarcasm has been an elusive concept for humans. Due to interesting linguistic properties, sarcasm detection has gained traction of the Natural Language Processing (NLP) research community in the past few years. However, the task of predicting sarcasm in a text remains a difficult one for machines as well, and there are limited insights into what makes a sentence sarcastic. Past studies in sarcasm detection either use large scale datasets collected using tag-based supervision or small scale manually annotated datasets. The former category of datasets are noisy in terms of labels and language, whereas the latter category of datasets do not have enough instances to train deep learning models reliably despite having high-quality labels. To overcome these shortcomings, we introduce a high-quality and relatively larger-scale dataset which is a collection of news headlines from a sarcastic news website and a real news website. We describe the unique aspects of our dataset and compare its various characteristics with other benchmark datasets in sarcasm detection domain. Furthermore, we produce insights into what constitute as sarcasm in a text using a Hybrid Neural Network architecture. First released in 2019, we dedicate a section on how the NLP research community has extensively relied upon our contributions to push the state of the art further in the sarcasm detection domain. Lastly, we make the dataset as well as framework implementation publicly available to facilitate continued research in this domain.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Sarcasm detection using news headlines dataset

Abstract

Talk to us

Similar Papers

More From: AI Open

Lead the way for us

Similar Papers

Online Abuse and Human Rights: WOAH Satellite Session at RightsCon 2020
Vinodkumar Prabhakaran ... Zeerak Waseem
-
Vinodkumar Prabhakaran, et. al.Vinodkumar Prabhakaran ... Zeerak Waseem
01 Jan 2020
Online Abuse and Human Rights: WOAH Satellite Session at RightsCon 2020
Vinodkumar Prabhakaran ... Zeerak Waseem

Not All Claims are Created Equal: Choosing the Right Statistical Approach to Assess Hypotheses
Erfan Sadeqi Azer ... Daniel Khashabi
-
Erfan Sadeqi Azer, et. al.Erfan Sadeqi Azer ... Daniel Khashabi
01 Jan 2020
01 Jan 2020

CHI 99 special interest group on natural language in computer-human interaction
Nancy Green ... David G Novick
-
Nancy Green, et. al.Nancy Green ... David G Novick
01 Jan 1998
01 Jan 1998

A web survey on the use of active learning to support annotation of text data
Katrin Tomanek ... Fredrik Olsson
-
Katrin Tomanek, et. al.Katrin Tomanek ... Fredrik Olsson
01 Jan 2009
01 Jan 2009

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Sarcasm detection using news headlines dataset

Abstract

Talk to us

Similar Papers

More From: AI Open