IRT Models Research Articles

One important problem in the measurement of non-cognitive characteristics such as personality traits and attitudes is that it has traditionally been made through Likert scales, which are susceptible to response biases such as social desirability (SDR) and acquiescent (ACQ) responding. Given the variability of these response styles in the population, ignoring their possible effects on the scores may compromise the fairness and the validity of the assessments. Also, response-style-induced errors of measurement can affect the reliability estimates and overestimate convergent validity by correlating higher with other Likert-scale-based measures. Conversely, it can attenuate the predictive power over non-Likert-based indicators, given that the scores contain more errors. This study compares the validity of the Big Five personality scores obtained: (1) ignoring the SDR and ACQ in graded-scale items (GSQ), (2) accounting for SDR and ACQ with a compensatory IRT model, and (3) using forced-choice blocks with a multi-unidimensional pairwise preference model (MUPP) variant for dominance items. The overall results suggest that ignoring SDR and ACQ offered the worst validity evidence, with a higher correlation between personality and SDR scores. The two remaining strategies have their own advantages and disadvantages. The results from the empirical reliability and the convergent validity analysis indicate that when modeling social desirability with graded-scale items, the SDR factor apparently captures part of the variance of the Agreeableness factor. On the other hand, the correlation between the corrected GSQ-based Openness to Experience scores, and the University Access Examination grades was higher than the one with the uncorrected GSQ-based scores, and considerably higher than that using the estimates from the forced-choice data. Conversely, the criterion-related validity of the Forced Choice Questionnaire (FCQ) scores was similar to the results found in meta-analytic studies, correlating higher with Conscientiousness. Nonetheless, the FCQ-scores had considerably lower reliabilities and would demand administering more blocks. Finally, the results are discussed, and some notes are provided for the treatment of SDR and ACQ in future studies.

Read full abstract

Women's empowerment is a process that includes increases in intrinsic agency (power within); instrumental agency (power to); and collective agency (power with). We used baseline data from two studies-Targeting and Realigning Agriculture for Improved Nutrition (TRAIN) in Bangladesh and Building Resilience in Burkina Faso (BRB)-to assess the measurement properties of survey questions operationalizing selected dimensions of intrinsic, instrumental, and collective agency in the project-level Women's Empowerment in Agricultural Index (pro-WEAI). We applied unidimensional item-response models to question (item) sets to assess their measurement properties, and when possible, their cross-context measurement equivalence-a requirement of measures designed for cross-group comparisons. For intrinsic agency in the right to bodily integrity, measured with five attitudinal questions about intimate partner violence (IPV) against women, model assumptions of unidimensionality and local independence were met. Four items showed good model fit and measurement equivalence across TRAIN and BRB. For item sets designed to capture autonomy in income, intrinsic agency in livelihoods activities, and instrumental agency in: livelihoods activities, the sale or use of outputs, the use of income, and borrowing from financial services, model assumptions were not met, model fit was poor, and items generally were weakly related to the latent (unobserved) agency construct. For intrinsic and instrumental agency in livelihoods activities and for instrumental agency in the sale or use of outputs and in the use of income, items sets had similar precision along the latent-agency continuum, suggesting that similar item sets could be dropped without a loss of precision. IRT models for collective agency were not estimable because of low reported presence and membership in community groups. This analysis demonstrates the use of IRT methods to assess the measurement properties of item sets in pro-WEAI, and empowerment scales generally. Findings suggest that a shorter version of pro-WEAI can be developed that will improve its measurement properties. We recommend revisions to the pro-WEAI questionnaire and call for new measures of women's collective agency.

Read full abstract

IRT Models Research Articles

Related Topics

Articles published on IRT Models

Residual-Based Person Fit Statistics over Test Sections

Controlling for Response Biases in Self-Report Scales: Forced-Choice vs. Psychometric Modeling of Likert Items.

Mixture Rasch Model with Main and Interaction Effects of Covariates on Latent Class Membership

Applications of Mixture IRT Models: A Literature Review

Are We Underestimating Food Insecurity? Partial Identification with a Bayesian 4-Parameter IRT Model

Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning

비대칭문항반응모형의 문항복잡성 타당화 가능성 연구: PISA 2012 프로세스 데이터의 응답 시간․횟수 변인을 중심으로

Comparative Analysis of Classical Test Theory and Item Response Theory using Chemistry Test Data

Comparative Analysis of Classical Test Theory and Item Response Theory using Chemistry Test Data

Contextual Responses to Affirmative and/or Reversed-Worded Items

A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats.

Measurement properties of the project-level Women's Empowerment in Agriculture Index.

Illustration of Multilevel Explanatory IRT Model DIF Testing with the Creative Thinking Scale

Combining mixture distribution and multidimensional IRTree models for the measurement of extreme response styles.

Evaluation on types of invariance in studying extreme response bias with an IRTree approach.

A Comparison of IRT Model Combinations for Assessing Fit in a Mixed Format Elementary School Science Test

Multivariate Higher-Order IRT Model and MCMC Algorithm for Linking Individual Participant Data From Multiple Studies.

Correction for Item Response Theory Latent Trait Measurement Error in Linear Mixed Effects Models

A New Perspective on the Multidimensionality of Divergent Thinking Tasks.

ITEM DIMENSIONALITY EXPLORATION BY MEANS OF CONSTRUCT MAP AND CATEGORICAL PRINCIPAL COMPONENTS ANALYSIS

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

IRT Models Research Articles

Related Topics

Articles published on IRT Models

Residual-Based Person Fit Statistics over Test Sections

Controlling for Response Biases in Self-Report Scales: Forced-Choice vs. Psychometric Modeling of Likert Items.

Mixture Rasch Model with Main and Interaction Effects of Covariates on Latent Class Membership

Applications of Mixture IRT Models: A Literature Review

Are We Underestimating Food Insecurity? Partial Identification with a Bayesian 4-Parameter IRT Model

Prediction of LC-MS/MS Properties of Peptides from Sequence by Deep Learning

비대칭문항반응모형의 문항복잡성 타당화 가능성 연구: PISA 2012 프로세스 데이터의 응답 시간․횟수 변인을 중심으로

Comparative Analysis of Classical Test Theory and Item Response Theory using Chemistry Test Data

Comparative Analysis of Classical Test Theory and Item Response Theory using Chemistry Test Data

Contextual Responses to Affirmative and/or Reversed-Worded Items

A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats.

Measurement properties of the project-level Women's Empowerment in Agriculture Index.

Illustration of Multilevel Explanatory IRT Model DIF Testing with the Creative Thinking Scale

Combining mixture distribution and multidimensional IRTree models for the measurement of extreme response styles.

Evaluation on types of invariance in studying extreme response bias with an IRTree approach.

A Comparison of IRT Model Combinations for Assessing Fit in a Mixed Format Elementary School Science Test

Multivariate Higher-Order IRT Model and MCMC Algorithm for Linking Individual Participant Data From Multiple Studies.

Correction for Item Response Theory Latent Trait Measurement Error in Linear Mixed Effects Models

A New Perspective on the Multidimensionality of Divergent Thinking Tasks.

ITEM DIMENSIONALITY EXPLORATION BY MEANS OF CONSTRUCT MAP AND CATEGORICAL PRINCIPAL COMPONENTS ANALYSIS