An Applied Statistics dataset for human vs AI-generated answer classification

Md Shahidul Salim,Sk Imran Hossain

doi:10.1016/j.dib.2024.110240

Abstract

Due to the increasing popularity of Large Language Models (LLMs) like ChatGPT, students from various fields now commonly rely on AI-powered text generation tools to complete their assignments. This poses a challenge for course instructors who struggle to identify the authenticity of submitted work. Several AI detection tools for differentiating human-generated text from AI-generated text exist for domains like medical and coding, and available generic tools do not perform well on domain-specific tasks. Those AI detection tools depend on LLM, and to train the LLM, an instruction dataset is needed that helps the LLM to learn the differences between patterns of human-generated text and AI-generated text. To help with the creation of a tool for Applied Statistics, we have created a dataset containing 4231 question-and-answer combinations. To create the dataset, first, we collected 116 questions covering a wide range of topics from Applied Statistics selected by domain experts. Second, we created a framework to randomly distribute and collect answers to the questions from students. Third, we collected answers to fifty assigned questions from each of the 100 students participating in the work. Fourth, we generated an equal number of AI-generated answers using ChatGPT. The prepared dataset will be useful for creating AI-detector tools for the Applied Statistics domain as well as benchmarking AI-detector tools, and the proposed data preparation framework will be useful for collecting data for other domains.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Data in Brief	Publication Date: Mar 2, 2024
Citations: 2	License type: cc-by

R Discovery Prime

R Discovery Prime

An Applied Statistics dataset for human vs AI-generated answer classification

Abstract

Talk to us

Similar Papers

More From: Data in Brief

Lead the way for us

Similar Papers

How Can IJDS Authors, Reviewers, and Editors Use (and Misuse) Generative AI?
Galit Shmueli ... W Nick Street
INFORMS Journal on Data Science | VOL. 2
Galit Shmueli, et. al.Galit Shmueli ... W Nick Street
01 Apr 2023
INFORMS Journal on Data Science | VOL. 2

Detection of immune-related adverse events among hospitalized patients using large language models.
Virginia H Sun ... Meghan E Sise
Journal of Clinical Oncology | VOL. 42
Virginia H Sun, et. al.Virginia H Sun ... Meghan E Sise
01 Jun 2024
Journal of Clinical Oncology | VOL. 42

Performance of Large Language Models on a Neurology Board–Style Examination
Marc Cicero Schubert ... Varun Venkataramani
JAMA network open | VOL. 6
Marc Cicero Schubert, et. al.Marc Cicero Schubert ... Varun Venkataramani
07 Dec 2023
JAMA network open | VOL. 6

Automating Information Retrieval from Biodiversity Literature Using Large Language Models: A Case Study
Vamsi Krishna Kommineni ... Birgitta Koenig-Ries
Biodiversity Information Science and Standards | VOL. 8
Vamsi Krishna Kommineni, et. al.Vamsi Krishna Kommineni ... Birgitta Koenig-Ries
10 Sep 2024
Biodiversity Information Science and Standards | VOL. 8

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

An Applied Statistics dataset for human vs AI-generated answer classification

Abstract

Talk to us

Similar Papers

More From: Data in Brief