A Theoretically Grounded Question Answering Data Set for Evaluating Machine Common Sense

Henrique Santos,Alice M Mulvehill,Ke Shen,Mayank Kejriwal,Deborah L Mcguinness

doi:10.1162/dint_a_00234

Abstract

ABSTRACT Achieving machine common sense has been a longstanding problem within Artificial Intelligence. Thus far, benchmark data sets that are grounded in a theory of common sense and can be used to conduct rigorous, semantic evaluations of common sense reasoning (CSR) systems have been lacking. One expectation of the AI community is that neuro-symbolic reasoners can help bridge this gap towards more dependable systems with common sense. We propose a novel benchmark, called Theoretically Grounded common sense Reasoning (TG-CSR), modeled as a set of question answering instances, with each instance grounded in a semantic category of common sense, such as space, time, and emotions. The benchmark is few-shot i.e., only a few training and validation examples are provided in the public release to avoid the possibility of overfitting. Results from recent evaluations suggest that TG-CSR is challenging even for state-of-the-art statistical models. Due to its semantic rigor, this benchmark can be used to evaluate the common sense reasoning capabilities of neuro-symbolic systems.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data Intelligence	Publication Date: Nov 7, 2023
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

A Theoretically Grounded Question Answering Data Set for Evaluating Machine Common Sense

Abstract

Talk to us

Similar Papers

More From: Data Intelligence

Lead the way for us

Similar Papers

Introduction: Progress in formal commonsense reasoning
Ernest Davis ... Leora Morgenstern
Artificial Intelligence | VOL. 153
Ernest Davis, et. al.Ernest Davis ... Leora Morgenstern
03 Dec 2003
Artificial Intelligence | VOL. 153

FULL-FLEDGED SEMANTIC ANALYSIS AS A TOOL FOR RESOLVING TRIANGLE-COPA SOCIAL SCENARIOS
I M Boguslavsky ... V G Dikonov
-
I M Boguslavsky, et. al.I M Boguslavsky ... V G Dikonov
01 Jan 2020
01 Jan 2020

Common Sense Reasoning in Diagnostic Systems
Alexander P. ... Vadim N.
-
Alexander P., et. al.Alexander P. ... Vadim N.
09 Sep 2011
09 Sep 2011

Improving Neural Story Generation by Targeted Common Sense Grounding
Huanru Henry Mao ... Garrison Cottrell
-
Huanru Henry Mao, et. al.Huanru Henry Mao ... Garrison Cottrell
01 Jan 2019
01 Jan 2019

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Theoretically Grounded Question Answering Data Set for Evaluating Machine Common Sense

Abstract

Talk to us

Similar Papers

More From: Data Intelligence