A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Jordan M Wheeler,Allan S Cohen,Shiyu Wang

doi:10.3102/10769986231209446

Abstract

Topic models are mathematical and statistical models used to analyze textual data. The objective of topic models is to gain information about the latent semantic space of a set of related textual data. The semantic space of a set of textual data contains the relationship between documents and words and how they are used. Topic models are becoming more common in educational measurement research as a method for analyzing students’ responses to constructed-response items. Two popular topic models are latent semantic analysis (LSA) and latent Dirichlet allocation (LDA). LSA uses linear algebra techniques, whereas LDA uses an assumed statistical model and generative process. In educational measurement, LSA is often used in algorithmic scoring of essays due to its high reliability and agreement with human raters. LDA is often used as a supplemental analysis to gain additional information about students, such as their thinking and reasoning. This article reviews and compares the LSA and LDA topic models. This article also introduces a methodology for comparing the semantic spaces obtained by the two models and uses a simulation study to investigate their similarities.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Abstract

Talk to us

Similar Papers

More From: Journal of Educational and Behavioral Statistics

Lead the way for us

Journal: Journal of Educational and Behavioral Statistics	Publication Date: Nov 27, 2023
Citations: 1

Similar Papers

Proposed information retrieval systems using LDA topic modeling for answer finding of COVID 19 pandemic: A brief survey of approaches and techniques
Suhad Ateyah ... Salam Al-Augby
-
Suhad Ateyah, et. al.Suhad Ateyah ... Salam Al-Augby
01 Jan 2023
01 Jan 2023

An intelligent literature review: adopting inductive approach to define machine learning applications in the clinical domain
Renu Sabharwal ... Shah J Miah
Journal of Big Data | VOL. 9
Renu Sabharwal, et. al.Renu Sabharwal ... Shah J Miah
28 Apr 2022
Journal of Big Data | VOL. 9

TOPIC MODELING IN COVID-19 VACCINATION REFUSAL CASES USING LATENT DIRICHLET ALLOCATION AND LATENT SEMANTIC ANALYSIS
Ulfah Malihatin S ... Uce Indahyanti
Jurnal Teknik Informatika (Jutif) | VOL. 4
Ulfah Malihatin S, et. al.Ulfah Malihatin S ... Uce Indahyanti
03 Oct 2023
Jurnal Teknik Informatika (Jutif) | VOL. 4

Sentiment Analysis of Consumer-Generated Online Reviews of Physical Bookstores Using Hybrid LSTM-CNN and LDA Topic Model
Yan Wang ... Xiaoyu Chang
-
Yan Wang, et. al.Yan Wang ... Xiaoyu Chang
01 Oct 2020
01 Oct 2020

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement

Abstract

Talk to us

Similar Papers

More From: Journal of Educational and Behavioral Statistics