Comparing Few-Shot Learning with GPT-3 to Traditional Machine Learning Approaches for Classifying Teacher Simulation Responses

Joshua Littenberg-Tobias,Justin Reich,Garron Hillaire,G R Marvez

doi:10.1007/978-3-031-11647-6_95

Joshua Littenberg-Tobias, Justin Reich + Show 2 more

https://doi.org/10.1007/978-3-031-11647-6_95

Copy DOI

Export

Save

Cite

Publication Date: Jan 1, 2022

Citations: 1

Affiliation: Massachusetts Institute of Technology

Abstract
Full-Text
Similar Papers

Abstract

Listen

AbstractTeacher educators use digital clinical simulations (DCS) to provide improvisation opportunities within low-stakes classroom environments. In this study, we experimented with GPT-3 and few-shot learning to examine if it could be used with open-text DCS responses. We found that GPT-3 performed substantially worse than traditional machine learning (ML) models even on the same-sized training sets. However, the performance of GPT-3 decreased only marginally compared to traditional ML models with a training set of 20 examples (−0.06). Traditional ML models generally performed well and in some cases had similar performance to the human baseline. Future research will examine whether changes to labeling procedures or fine-tuning with existing data can improve the performance of GPT-3 with DCSs.KeywordsNatural language processingFew-shot learningGPT-3SimulationsTeacher educationProfessional learning

Full Text