Test Assembly Research Articles

BackgroundAutomated test assembly (ATA) represents a modern methodology that employs data science optimization on computer platforms to automatically create test form, thereby significantly improving the efficiency and accuracy of test assembly procedures. In the realm of medical education, large-scale high-stakes assessments often necessitate lengthy tests, leading to elevated costs in various dimensions (such as examinee fatigue and expenses associated with item development). This study aims to augment the design of the medical education assessments by leveraging modern ATA approaches.MethodsTo achieve the objective, a four-step process employing psychometric methodologies was used to calibrate and analyze the item pool of the Standardized Competence Test for Clinical Medicine Undergraduates (SCTCMU), a nationwide summative test comprising 300 multiple-choice questions (MCQ) in China. Subsequently, two modern ATA approaches were employed to determine the optimal item combination, accounting for both statistical and content requirements specified in the test blueprint. The qualities of the assembled test form, generated using modern ATA approaches, underwent meticulous evaluation.ResultsThrough an exploration of the psychometric properties of the SCTCMU as a foundational step, the evaluation revealed commendable quality in the item properties. Furthermore, the evaluation of the quality of assembled test form using modern ATA approaches indicated the ability to ascertain the optimal test length within the predefined measurement precision. Specifically, this investigation demonstrates that the application of modern ATA approaches can substantially reduce the test length of assembled test form, while simultaneously maintaining the required statistical and content standards specified in the test blueprint.ConclusionsThis study harnessed modern ATA approaches to facilitate the automatic construction of test form, thereby significantly enhancing the efficiency and precision of test assembly procedures. The utilization of modern ATA approaches offers medical educators a valuable tool to enhance the efficiency and cost-effectiveness of medical education assessment.

Read full abstract

The National Board of Osteopathic Medical Examiners (NBOME) administers the Comprehensive Osteopathic Medical Licensing Examination of the United States (COMLEX-USA), a three-level examination designed for licensure for the practice of osteopathic medicine. The examination design for COMLEX-USA Level 3 (L3) was changed in September 2018 to a two-day computer-based examination with two components: a multiple-choice question (MCQ) component with single best answer and a clinical decision-making (CDM) case component with extended multiple-choice (EMC) and short answer (SA) questions. Continued validation of the L3 examination, especially with the new design, is essential for the appropriate interpretation and use of the test scores. The purpose of this study is to gather evidence to support the validity of the L3 examination scores under the new design utilizing sources of evidence based on Kane's validity framework. Kane's validity framework contains four components of evidence to support the validity argument: Scoring, Generalization, Extrapolation, and Implication/Decision. In this study, we gathered data from various sources and conducted analyses to provide evidence that the L3 examination is validly measuring what it is supposed to measure. These include reviewing content coverage of the L3 examination, documenting scoring and reporting processes, estimating the reliability and decision accuracy/consistency of the scores, quantifying associations between the scores from the MCQ and CDM components and between scores from different competency domains of the L3 examination, exploring the relationships between L3 scores and scores from a performance-based assessment that measures related constructs, performing subgroup comparisons, and describing and justifying the criterion-referenced standard setting process. The analysis data contains first-attempt testscores for 8,366 candidates who took the L3 examination between September 2018 and December 2019. The performance-based assessment utilized as a criterion measure in this study is COMLEX-USA Level 2 Performance Evaluation (L2-PE). All assessment forms were built through the automated test assembly (ATA) procedure to maximize parallelism in terms of content coverage and statistical properties across the forms. Scoring and reporting follows industry-standard quality-control procedures. The inter-rater reliability of SA rating, decision accuracy, and decision consistency for pass/fail classifications are all very high. There is a statistically significant positive association between the MCQ and the CDM components of the L3 examination. The patterns of associations, both within the L3 subscores and with L2-PE domain scores, fit with what is being measured. The subgroup comparisons by gender, race, and first language showed expected small differences in mean scores between the subgroups within each category and yielded findings that are consistent with those described in the literature. The L3 pass/fail standard was established through implementation of a defensible criterion-referenced procedure. This study provides some additional validity evidence for the L3 examination based on Kane's validity framework. The validity of any measurement must be established through ongoing evaluation of the related evidence. The NBOME will continue to collect evidence to support validity arguments for the COMLEX-USA examination series.

Read full abstract

Test Assembly Research Articles

Related Topics

Articles published on Test Assembly

Study on fuel management strategy for accident-tolerant fuel on in-pile irradiation test

Empirical Analysis of Physics Test Instruments to Measure Graphical Representation Abilities in “Temperature and Heat” Topics

Optimizing a national examination for medical undergraduates via modern automated test assembly approaches

In vitro study to analyze reverse torque values of attachment screws with multiple loosening/tightening preload cycles.

Computational thermal fluid enabled multi-physics transport processes analysis of ceramic hollow fiber membrane for oxygen separation

Experimental, analytical, and numerical quantification of the Marangoni effect in static refractory finger test

Upsampling Monte Carlo Reactor Simulation Tallies in Depleted Sodium-Cooled Fast Reactor Assemblies Using a Convolutional Neural Network

Development of Learning Outcome Test Instruments on Thermodynamics and Waves Based on The Demands of Basic Competencies

A validity study of COMLEX-USA Level 3 with the new test design.

Development of a new acoustic prediction tool by integration of life cycle assessment

Investigation of a multistage adaptive test based on test assembly methods

Ant colony optimization for parallel test assembly.

Upsampling Monte Carlo reactor simulation tallies in depleted LWR assemblies fueled with LEU and HALEU using a convolutional neural network

Machine Learning-Driven Maintenance Order Generation in Assembly Lines

Exploring Multiple-Objective Optimization for Efficient and Effective Test Paper Design with Dynamic Programming Guided Genetic Algorithm.

Item exposure and utilization control methods for optimal test assembly

Validation of GAMMA+ code for SFR application using FFTF loss-of-flow-without-scram test results

HLA-G 3'UTR haplotype analyses in HCV infection and HCV-derived cirrhosis, hepatocellular carcinoma and fibrosis.

Effect of Differently Shaped Solder Joints of Chip Resistor on Fatigue Life

Action observation plus motor imagery and somatosensory discrimination training are effective non-motor approaches to improve manual dexterity

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Test Assembly Research Articles

Related Topics

Articles published on Test Assembly

Study on fuel management strategy for accident-tolerant fuel on in-pile irradiation test

Empirical Analysis of Physics Test Instruments to Measure Graphical Representation Abilities in “Temperature and Heat” Topics

Optimizing a national examination for medical undergraduates via modern automated test assembly approaches

In vitro study to analyze reverse torque values of attachment screws with multiple loosening/tightening preload cycles.

Computational thermal fluid enabled multi-physics transport processes analysis of ceramic hollow fiber membrane for oxygen separation

Experimental, analytical, and numerical quantification of the Marangoni effect in static refractory finger test

Upsampling Monte Carlo Reactor Simulation Tallies in Depleted Sodium-Cooled Fast Reactor Assemblies Using a Convolutional Neural Network

Development of Learning Outcome Test Instruments on Thermodynamics and Waves Based on The Demands of Basic Competencies

A validity study of COMLEX-USA Level 3 with the new test design.

Development of a new acoustic prediction tool by integration of life cycle assessment

Investigation of a multistage adaptive test based on test assembly methods

Ant colony optimization for parallel test assembly.

Upsampling Monte Carlo reactor simulation tallies in depleted LWR assemblies fueled with LEU and HALEU using a convolutional neural network

Machine Learning-Driven Maintenance Order Generation in Assembly Lines

Exploring Multiple-Objective Optimization for Efficient and Effective Test Paper Design with Dynamic Programming Guided Genetic Algorithm.

Item exposure and utilization control methods for optimal test assembly

Validation of GAMMA+ code for SFR application using FFTF loss-of-flow-without-scram test results

HLA-G 3'UTR haplotype analyses in HCV infection and HCV-derived cirrhosis, hepatocellular carcinoma and fibrosis.

Effect of Differently Shaped Solder Joints of Chip Resistor on Fatigue Life

Action observation plus motor imagery and somatosensory discrimination training are effective non-motor approaches to improve manual dexterity