Evaluating the Coverage and Depth of Latent Dirichlet Allocation Topic Model in Comparison with Human Coding of Qualitative Data: The Case of Education Research

Gaurav Nanda,Hugo Castellanos,Alex Choi,Aparajita Jaiswal,Alejandra J Magana,Yuzhe Zhou

doi:10.3390/make5020029

Abstract

Fields in the social sciences, such as education research, have started to expand the use of computer-based research methods to supplement traditional research approaches. Natural language processing techniques, such as topic modeling, may support qualitative data analysis by providing early categories that researchers may interpret and refine. This study contributes to this body of research and answers the following research questions: (RQ1) What is the relative coverage of the latent Dirichlet allocation (LDA) topic model and human coding in terms of the breadth of the topics/themes extracted from the text collection? (RQ2) What is the relative depth or level of detail among identified topics using LDA topic models and human coding approaches? A dataset of student reflections was qualitatively analyzed using LDA topic modeling and human coding approaches, and the results were compared. The findings suggest that topic models can provide reliable coverage and depth of themes present in a textual collection comparable to human coding but require manual interpretation of topics. The breadth and depth of human coding output is heavily dependent on the expertise of coders and the size of the collection; these factors are better handled in the topic modeling approach.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Machine Learning and Knowledge Extraction	Publication Date: May 14, 2023
Citations: 5	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

Evaluating the Coverage and Depth of Latent Dirichlet Allocation Topic Model in Comparison with Human Coding of Qualitative Data: The Case of Education Research

Abstract

Talk to us

Similar Papers

More From: Machine Learning and Knowledge Extraction

Lead the way for us

Similar Papers

A topic model approach to identify and track emerging risks from beeswax adulteration in the media
Agnes Rortais ... Lidija Svečnjak
Food Control | VOL. 119
Agnes Rortais, et. al.Agnes Rortais ... Lidija Svečnjak
02 Jul 2020
Food Control | VOL. 119

Asynchronous Digital Participation in Urban Design Processes: Qualitative Data Exploration and Analysis With Natural Language Processing
Cem Ataman ... Simon Perrault
-
Cem Ataman, et. al.Cem Ataman ... Simon Perrault
01 Jan 2021
01 Jan 2021

An intelligent literature review: adopting inductive approach to define machine learning applications in the clinical domain
Renu Sabharwal ... Shah J Miah
Journal of Big Data | VOL. 9
Renu Sabharwal, et. al.Renu Sabharwal ... Shah J Miah
28 Apr 2022
Journal of Big Data | VOL. 9

Trends in COVID-19 Publications: Streamlining Research Using NLP and LDA
Akash Gupta ... Anjali Agrawal
SSRN Electronic Journal | VOL. -
Akash Gupta, et. al.Akash Gupta ... Anjali Agrawal
01 Jan 2020
SSRN Electronic Journal | VOL. -

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Evaluating the Coverage and Depth of Latent Dirichlet Allocation Topic Model in Comparison with Human Coding of Qualitative Data: The Case of Education Research

Abstract

Talk to us

Similar Papers

More From: Machine Learning and Knowledge Extraction