UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark

Nicholas Lourie,Yejin Choi,Ronan Le Bras,Chandra Bhagavatula

doi:10.1609/aaai.v35i15.17590

Abstract

Commonsense AI has long been seen as a near impossible goal---until recently. Now, research interest has sharply increased with an influx of new benchmarks and models. We propose two new ways to evaluate commonsense models, emphasizing their generality on new tasks and building on diverse, recently introduced benchmarks. First, we propose a new multitask benchmark, Rainbow, to promote research on commonsense models that generalize well over multiple tasks and datasets. Second, we propose a novel evaluation, the cost equivalent curve, that sheds new insight on how the choice of source datasets, pretrained language models, and transfer learning methods impacts performance and data efficiency. We perform extensive experiments---over 200 experiments encompassing 4800 models---and report multiple valuable and sometimes surprising findings, e.g., that transfer almost always leads to better or equivalent performance if following a particular recipe, that QA-based commonsense datasets transfer well with each other, while commonsense knowledge graphs do not, and that perhaps counter-intuitively, larger models benefit more from transfer than smaller ones. Last but not least, we introduce a new universal commonsense reasoning model, UNICORN, that establishes new state-of-the-art performance across 8 popular commonsense benchmarks, aNLI (87.3%), CosmosQA (91.8%), HellaSWAG (93.9%), PIQA (90.1%), SocialIQa (83.2%), WinoGrande (86.6%), CycIC (94.0%) and CommonsenseQA (79.3%).

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence

Lead the way for us

Journal: Proceedings of the AAAI Conference on Artificial Intelligence	Publication Date: May 18, 2021
Citations: 16

Similar Papers

Symbolic Knowledge Distillation: from General Language Models to Commonsense Models
...
-
, et. al. ...
27 Jun 2022
27 Jun 2022

Utilizing Language Models to Expand Vision-Based Commonsense Knowledge Graphs
Navid Rezaei ... Marek Z Reformat
Symmetry | VOL. 14
Navid Rezaei, et. al.Navid Rezaei ... Marek Z Reformat
17 Aug 2022
Symmetry | VOL. 14

(Comet-) Atomic 2020: On Symbolic and Neural Commonsense Knowledge Graphs
Jena D Hwang ... Ronan Le Bras
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 35
Jena D Hwang, et. al.Jena D Hwang ... Ronan Le Bras
18 May 2021
(Comet-) Atomic 2020: On Symbolic and Neural Commonsense Knowledge Graphs
Jena D Hwang ... Ronan Le Bras

COMET: Commonsense Transformers for Automatic Knowledge Graph Construction
Antoine Bosselut ... Asli Celikyilmaz
-
Antoine Bosselut, et. al.Antoine Bosselut ... Asli Celikyilmaz
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

UNICORN on RAINBOW: A Universal Commonsense Reasoning Model on a New Multitask Benchmark

Abstract

Talk to us

Similar Papers

More From: Proceedings of the AAAI Conference on Artificial Intelligence