Local Independence Assumption Research Articles

A testlet is comprised of a group of multiple choice items based on a common stimuli. When a testlet is used, the traditional item response models may not be appropriate due to the violation of the assumption of local independence (LI). A variety of new models have been proposed to analyze response data sets for testlets. Among them, the Bayesian random effects model proposed by Bradlow, Wainer and Wang (1999) is one of the most promising. However, in many situations it is not clear to practitioners whether the traditional IRT methods should still be used instead of a newly proposed testlet model. The objective of the current study is to investigate the effects of model selection in various situations. In simulation 1, simulated response data sets were generated under three simulation factors, which were: testlet variance (0, 0.5, 1, 2); testlet size (2, 5, 10); and test length (20, 40, 60). For each simulation condition, the test structure was determined by fixing the number of examinees as I =2000, and the percentage of testlet items in a test as 50%. Under each condition, 30 replications were generated. Both two-parameter Bayesian testlet random effect model and standard two-parameter Bayesian model were fitted to every dataset using MCMC method. The computer program SCORIGHT was used to conduct all the analysis across different conditions. Two models were compared corresponding to seven criteria: bias, mean absolute error, root mean square error, correlation between estimated and true values, 95% posterior interval width, 95% coverage probability. These indexes were computed for all parameters separately. Simulation 2 compared the two models under two factors: the proportion of independent items (1/3, 1/2, 2/3); test length (20, 30, 40, 60). The data generation, analyze process and criteria mimicked those of simulation 1. The results showed that: (1) The accuracy of the estimation of all parameters under 2-PL Bayesian testlet random-effect model remained stable with varying levels of testlet effect and testlet size. However, the estimate errors of all the parameters under 2-PL Bayesian model increased dramatically as the testlet effect and testlet size became larger. Besides, using Bayesian testlet random-effect model, the error for every parameter was always less than that for 2-PL Bayesian model. It was especially necessary to choose 2-PL Bayesian testlet random-effects model when testlet variance and testlet size were large. (2) Even though, the accuracy of estimation of item parameters in Bayesian testlet random-effect model wasn't affected by test length, the accuracy of ability parameter was. Moreover, as the test got shorter, the errors of all parameters under 2-PL Bayesian model increased dramatically. In all, under short test conditions, even if there was large testlet effect, Bayesian testlet random-effect model couldn't work well, meanwhile, if items were all independent, using Bayesian testlet random-effect model would result in much worse ability estimations than 2-PL Bayesian model. (3) When the proportion of independent items was large, and the test length was larger than 20 items, the estimations of two models didn't show significant differences. In conclusion, 2-PL Bayesian testlet random-effect model is more general. Using the more complex testlet model when items are all independent, will lead almost the same accuracy of the parameter estimations as using the 2-PL Bayesian model. It is better to choose 2-PL Bayesian testlet random-effect model when testlet variance, testlet size, and test length are large. However, when test length is short, even the Bayesian testlet random- effects model couldn't provide accurate estimations of parameters when local dependence happened. So it is important to make sure the test was comprised of enough items before applying a testlet model. We also give some suggestions for practitioners. In the test construction period, first it is better for items to be independent, if not, shorter testlets and larger proportion of independent items should be included. While in the test analysis period, local dependence should be detected first. If evidence shows that there is dependence structure, then an appropriate model should be chosen to avoid estimation errors.

Read full abstract

We review the papers presented at the NCI/DIA conference, to identify areas of controversy and uncertainty, and to highlight those aspects of item response theory (IRT) and computer adaptive testing (CAT) that require theoretical or empirical research in order to justify their application to patient reported outcomes (PROs). IRT and CAT offer exciting potential for the development of a new generation of PRO instruments. However, most of the research into these techniques has been in non-healthcare settings, notably in education. Educational tests are very different from PRO instruments, and consequently problematic issues arise when adapting IRT and CAT to healthcare research. Clinical scales differ appreciably from educational tests, and symptoms have characteristics distinctly different from examination questions. This affects the transferring of IRT technology. Particular areas of concern when applying IRT to PROs include inadequate software, difficulties in selecting models and communicating results, insufficient testing of local independence and other assumptions, and a need of guidelines for estimating sample size requirements. Similar concerns apply to differential item functioning (DIF), which is an important application of IRT. Multidimensional IRT is likely to be advantageous only for closely related PRO dimensions. Although IRT and CAT provide appreciable potential benefits, there is a need for circumspection. Not all PRO scales are necessarily appropriate targets for this methodology. Traditional psychometric methods, and especially qualitative methods, continue to have an important role alongside IRT. Research should be funded to address the specific concerns that have been identified.

Read full abstract

Local Independence Assumption Research Articles

Related Topics

Articles published on Local Independence Assumption

The Usefulness of the Rasch Model for the Refinement of Likert Scale Questionnaires

The Performance of Local Dependence Measures With Psychological Data

The Application of Graded Response Model to the Test Data Violated the Assumption of Local Independence

Identifying Local Dependence With a Score Test Statistic Based on the Bifactor Logistic Model

Assessing Fit of Item Response Models Using the Information Matrix Test

When Should We Use Testlet Model? A Comparison Study of Bayesian Testlet Random-Effects Model and Standard 2-PL Bayesian Model

A laboratory study on the reliability estimations of the mini-CEX

Psychometric assessment of HIV/STI sexual risk scale among MSM: A Rasch model approach

Non-compliance in surgical patients with herniated lumbar discs

Hidden Markov partition models

De complexiteit van het concept mondgezondheidgerelateerde levenskwaliteit

Assessing Fit of Unidimensional Graded Response Models Using Bayesian Methods

Classification and generation of disturbance vectors for collision attacks against SHA-1

Local Dependence Model in Latent Rank Theory

A dimensionally reduced finite mixture model for multilevel data

Model specification in oral health‐related quality of life research

Locally Dependent Latent Class Models with Covariates: An Application to Under-Age Drinking in the USA

Applying item response theory and computer adaptive testing: the challenges for health outcomes assessment

Testing for Local Dependence in Rasch’s Multiplicative Gamma Model for Speed Tests

The Presence and Impact of Local Item Dependence on Objective Structured Clinical Examinations Scores and the Potential Use of the Polytomous, Many-Facet Rasch Model

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Local Independence Assumption Research Articles

Related Topics

Articles published on Local Independence Assumption

The Usefulness of the Rasch Model for the Refinement of Likert Scale Questionnaires

The Performance of Local Dependence Measures With Psychological Data

The Application of Graded Response Model to the Test Data Violated the Assumption of Local Independence

Identifying Local Dependence With a Score Test Statistic Based on the Bifactor Logistic Model

Assessing Fit of Item Response Models Using the Information Matrix Test

When Should We Use Testlet Model? A Comparison Study of Bayesian Testlet Random-Effects Model and Standard 2-PL Bayesian Model

A laboratory study on the reliability estimations of the mini-CEX

Psychometric assessment of HIV/STI sexual risk scale among MSM: A Rasch model approach

Non-compliance in surgical patients with herniated lumbar discs

Hidden Markov partition models

De complexiteit van het concept mondgezondheidgerelateerde levenskwaliteit

Assessing Fit of Unidimensional Graded Response Models Using Bayesian Methods

Classification and generation of disturbance vectors for collision attacks against SHA-1

Local Dependence Model in Latent Rank Theory

A dimensionally reduced finite mixture model for multilevel data

Model specification in oral health‐related quality of life research

Locally Dependent Latent Class Models with Covariates: An Application to Under-Age Drinking in the USA

Applying item response theory and computer adaptive testing: the challenges for health outcomes assessment

Testing for Local Dependence in Rasch’s Multiplicative Gamma Model for Speed Tests

The Presence and Impact of Local Item Dependence on Objective Structured Clinical Examinations Scores and the Potential Use of the Polytomous, Many-Facet Rasch Model