TG-CSR: A human-labeled dataset grounded in nine formal commonsense categories

Henrique Santos,Alice M Mulvehill,Ke Shen,Mayank Kejriwal,Deborah L Mcguinness

doi:10.1016/j.dib.2023.109666

Henrique Santos, Alice M Mulvehill + Show 3 more

Open Access

https://doi.org/10.1016/j.dib.2023.109666

Copy DOI

Abstract

Machine Common Sense Reasoning is the subfield of Artificial Intelligence that aims to enable machines to behave or make decisions similarly to humans in everyday and ordinary situations. To measure progress, benchmarks in the form of question-answering datasets have been developed and published in the community to evaluate machine commonsense models, including large language models. We describe the individual label data produced by six human annotators originally used in computing ground truth for the Theoretically-Grounded Commonsense Reasoning (TG-CSR) benchmark's composing datasets. According to a set of instructions, annotators were provided with spreadsheets containing the original TG-CSR prompts and asked to insert labels in specific spreadsheet cells during annotation sessions. TG-CSR data is organized in JSON files, individual raw label data in a spreadsheet file, and individual normalized label data in JSONL files. The release of individual labels can enable the analysis of the labeling process itself, including studies of noise and consistency across annotators.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data in Brief	Publication Date: Oct 11, 2023
Citations: 1	License type: cc-by

R Discovery Prime

R Discovery Prime

TG-CSR: A human-labeled dataset grounded in nine formal commonsense categories

Abstract

Talk to us

Similar Papers

More From: Data in Brief

Lead the way for us

Similar Papers

A Theoretically Grounded Question Answering Data Set for Evaluating Machine Common Sense
Henrique Santos ... Deborah L Mcguinness
Data Intelligence | VOL. -
Henrique Santos, et. al.Henrique Santos ... Deborah L Mcguinness
07 Nov 2023
Data Intelligence | VOL. -

Introduction: Progress in formal commonsense reasoning
Ernest Davis ... Leora Morgenstern
Artificial Intelligence | VOL. 153
Ernest Davis, et. al.Ernest Davis ... Leora Morgenstern
03 Dec 2003
Artificial Intelligence | VOL. 153

FULL-FLEDGED SEMANTIC ANALYSIS AS A TOOL FOR RESOLVING TRIANGLE-COPA SOCIAL SCENARIOS
I M Boguslavsky ... V G Dikonov
-
I M Boguslavsky, et. al.I M Boguslavsky ... V G Dikonov
01 Jan 2020
01 Jan 2020

CORECODE: A Common Sense Annotated Dialogue Dataset with Benchmark Tasks for Chinese Large Language Models
Dan Shi ... Taihao Li
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38
Dan Shi, et. al.Dan Shi ... Taihao Li
24 Mar 2024
Proceedings of the AAAI Conference on Artificial Intelligence | VOL. 38

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

TG-CSR: A human-labeled dataset grounded in nine formal commonsense categories

Abstract

Talk to us

Similar Papers

More From: Data in Brief