Abstract

AbstractOver the past decade, topic models have been used to analyze students’ responses to constructed-response items. Analyzing students’ responses using topic models has been shown to yield similar results to a qualitative analysis. As the use of topic models increases in the educational setting, it is important to assess the performance of the underlying statistical mechanism. Simulation studies are an essential tool when evaluating the performance of a statistical model. Using a simulation study to assess performance of topic models, such as the latent Dirichlet allocation (LDA) model, requires generating simulated text responses rather than scored responses. LDA and other related topic models, such as the supervised latent Dirichlet allocation model, assumes a generative process for construction of responses. Topic models also assume that the text data follows a bag-of-words distribution. These key assumptions allow generating simulated text responses to be possible. In this paper we demonstrate the simulation process for topic models followed by a simulation study that assesses the sample size needed to recover the parameters of the LDA model.KeywordsEducational topic modelsBayesian estimationSimulation study

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.