Psychometric Issues Research Articles

AbstractBackgroundThe extent to which existing legacy memory tests are accurate and sensitive in detecting early cognitive decline is questionable. This reflects the wider picture in neurodegenerative conditions where clinical rating scales have not been developed for early‐stage disease. Item banks delivered through computer adaptive tests can help. But, in order to be fit‐for‐purpose, this approach requires a metrological framework with recourse to units, traceability, and interoperability. Here we describe the initial research to build such an item bank, based on legacy tests.MethodMemory tests (i.e., Corsi Block Test [CBT], Digit Span Test [DST], Rey’s Auditory Verbal Learning Test [RAVLT] and Word Learning List [WLL]) data were collected in the European EMPIR NeuroMET and the SMART cohorts recruited in Charité Hospital (Healthy control n 86; Subjective Cognitive Decline n 99; Mild Cognitive Impairment n 37; and Alzheimer’s Disease n 45). In order to align with metrological requirements, Rasch measurement theory in conjunction with construct specification equations were chosen to analyse the data.ResultBased on the combined dataset, when analyzed on their own, the data from the CBT, DST, RAVLT and WLL revealed skewness, gaps, and large measurement uncertainties. The addition of items from each of the tests into a bank improved these psychometric issues, improving reliability from a minimum of 0.65 to 0.85. The metrological legitimization of the ‘NeuroMET Memory Metric’ (formed from the combination of items) was confirmed through construct specification equations, which provided a Pearson correlation coefficient for empirical values vs. predicted (zR) values of up to 0.98 for task difficulty.ConclusionOur early promising findings provide a strong foundation for a metric of memory ability, which can be used as the basis for computer adaptive testing, better targeted measurement, and more accurate discrimination of early cognitive decline. Establishing units can also provide for the potential of developing crosswalks between a wider range of memory test items. Further work includes further longitudinal data collection to confirm item estimates, development of a digital platform, and a quality assurance program to establish traceability, and interoperability.

Innovation Concept: The outcome of emergency medicine training is to produce physicians who can competently run an emergency department (ED) shift. While many workplace-based ED assessments focus on discrete tasks of the discipline, others emphasize assessment of performance across the entire shift. However, the quality of assessments is generally poor and these tools often lack validity evidence. The use of entrustment scale anchors may help to address these psychometric issues. The aim of this study was to develop and gather validity evidence for a novel tool to assess a resident's ability to independently run an ED shift. Methods: Through a nominal group technique, local and national stakeholders identified dimensions of performance reflective of a competent ED physician. These dimensions were included in a new tool that was piloted in the Department of Emergency Medicine at the University of Ottawa during a 4-month period. Psychometric characteristics of the items were calculated, and a generalizability analysis used to determine the reliability of scores. An ANOVA was conducted to determine whether scores increased as a function of training level (junior = PGY1-2, intermediate = PGY3, senior = PGY4-5), and varied by ED treatment area. Safety for independent practice was analyzed with a dichotomous score. Curriculum, Tool or Material: The developed Ottawa Emergency Department Shift Observation Tool (O-EDShOT) includes 12-items rated on a 5-point entrustment scale with a global assessment item and 2 short-answer questions. Eight hundred and thirty-three assessment were completed by 78 physicians for 45 residents. Mean scores differed significantly by training level (p &lt; .001) with junior residents receiving lower ratings (3.48 ± 0.69) than intermediate residents who received lower ratings (3.98 ± 0.48) than senior residents (4.54 ± 0.42). Scores did not vary by ED treatment area (p &gt; .05). Residents judged to be safe to independently run the shift had significantly higher mean scores than those judged not to be safe (4.74 ± 0.31 vs 3.75 ± 0.66; p &lt; .001). Fourteen observations per resident, the typical number recorded during a 1-month rotation, were required to achieve a reliability of 0.80. Conclusion: The O-EDShOT successfully discriminated between junior, intermediate and senior-level residents regardless of ED treatment area. Multiple sources of evidence support the O-EDShOT producing valid scores for assessing a resident's ability to independently run an ED shift.

Psychometric Issues Research Articles

Related Topics

Articles published on Psychometric Issues

More than a memory test: A new metric linking blocks, numbers, and words

Clinical measures of communication limitations in dysarthria assessed through crowdsourcing: specificity, sensitivity, and retest-reliability

Psychometric properties, validity and insights of the School Bullying Questionnaire (CIE-A) in secondary schools of the Valencian Community (Spain).

Evaluation of the Psychometric Properties of the Burn Specific Health Scale-Brief: A National Institute on Disability, Independent Living, and Rehabilitation Research Burn Model System Study.

Five shades of gray: Conceptual and methodological issues around multivariate models of performance validity.

Efficacy of Virtual Reality Induced Environmental and Habitual Navigation on Psychological, Cognitive Function that Impacts on Physical Recovery in Patients with Stroke

Personality computing: New frontiers in personality assessment

Time to Disentangle the Information and Communication Technology (ICT) Constructs: Developing a Taxonomy around ICT Use for Occupational Health Research.

Research on parental burnout across cultures: Steps toward global understanding.

Emory University School of Medicine.

Situational Judgment Tests: An Overview of Development Practices and Psychometric Characteristics

Measuring Teacher Sense of Efficacy: Insights and Recommendations Concerning Scale Design and Data Analysis from Research with Preservice and Inservice Teachers in China

Unraveling the genetic architecture of major depressive disorder: merits and pitfalls of the approaches used in genome-wide association studies.

Modeling of Item Response Functions Under the D-Scoring Method.

The Vancouver Index of Acculturation (VIA): New evidence on dimensionality and measurement invariance across two cultural settings

LO84: Ready to run the show: development of a new instrument for assessing resident competence in the emergency department

Paradoxes and Dual Processes: A Review and Synthesis

Preliminary psychometric analyses of the International Spinal Cord Injury Quality of Life Basic Data Set.

Rash Model Analysis of Kaufman Domains of Creativity Scale (K-DOCS) to Indonesian Students

About carts, horses, and other psychometric issues

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Psychometric Issues Research Articles

Related Topics

Articles published on Psychometric Issues

More than a memory test: A new metric linking blocks, numbers, and words

Clinical measures of communication limitations in dysarthria assessed through crowdsourcing: specificity, sensitivity, and retest-reliability

Psychometric properties, validity and insights of the School Bullying Questionnaire (CIE-A) in secondary schools of the Valencian Community (Spain).

Evaluation of the Psychometric Properties of the Burn Specific Health Scale-Brief: A National Institute on Disability, Independent Living, and Rehabilitation Research Burn Model System Study.

Five shades of gray: Conceptual and methodological issues around multivariate models of performance validity.

Efficacy of Virtual Reality Induced Environmental and Habitual Navigation on Psychological, Cognitive Function that Impacts on Physical Recovery in Patients with Stroke

Personality computing: New frontiers in personality assessment

Time to Disentangle the Information and Communication Technology (ICT) Constructs: Developing a Taxonomy around ICT Use for Occupational Health Research.

Research on parental burnout across cultures: Steps toward global understanding.

Emory University School of Medicine.

Situational Judgment Tests: An Overview of Development Practices and Psychometric Characteristics

Measuring Teacher Sense of Efficacy: Insights and Recommendations Concerning Scale Design and Data Analysis from Research with Preservice and Inservice Teachers in China

Unraveling the genetic architecture of major depressive disorder: merits and pitfalls of the approaches used in genome-wide association studies.

Modeling of Item Response Functions Under the D-Scoring Method.

The Vancouver Index of Acculturation (VIA): New evidence on dimensionality and measurement invariance across two cultural settings

LO84: Ready to run the show: development of a new instrument for assessing resident competence in the emergency department

Paradoxes and Dual Processes: A Review and Synthesis

Preliminary psychometric analyses of the International Spinal Cord Injury Quality of Life Basic Data Set.

Rash Model Analysis of Kaufman Domains of Creativity Scale (K-DOCS) to Indonesian Students

About carts, horses, and other psychometric issues