Low-stakes Tests Research Articles

In the domain of self-assessment, researchers have begun to draw distinctions between summative self-assessment activities (i.e., making an overall judgment of one's ability in a particular domain) and self-monitoring processes (i.e., an "in the moment" awareness of whether one has the necessary knowledge or skills to address a specific problem with which one is faced). Indeed, previous research has shown that, when responding to both short answer and multiple choice questions, individuals are able to assess the likelihood of answering questions correctly on a moment-by-moment basis, even though they are not able to generate an accurate self-assessment of overall performance on the test. These studies, however, were conducted in the context of low-stakes tests of general "trivia". The purpose of the present study was to further this line of research by investigating the relationship between self-monitoring and performance in the context of a high stakes test assessing medical knowledge. Using a recent administration of the Medical Council of Canada Qualifying Examination Part I, we examined three measures intended to capture self-monitoring: (1) the time taken to respond to each question, (2) the number of questions a candidate flagged as needing to be considered further, and (3) the likelihood of changing one's initial answer. Differences in these measures as a function of the accuracy of the candidate's response were treated as indices of each candidate's ability to judge his or her likelihood of responding correctly. The three self-monitoring indices were compared for candidates at three different levels of overall performance on the exam. Relative to correct responses, when examinees initially responded incorrectly, they spent more time answering the question, were more likely to flag the question for future consideration, and were more likely to change their answer before committing to a final answer. These measures of self-monitoring were modulated by candidate performance in that high performing examinees showed greater differences on these indices relative to poor performing examinees. Furthermore, reliability analyses suggest that these difference measures hold promise for reliably differentiating self-monitoring at the level of individuals, at least within a given content area. The results suggest that examinees were self-monitoring their knowledge and skills on a question by question basis and altering their behavior appropriately in the moment. High performing individuals showed stronger evidence of accurate self-monitoring than did low performing individuals and the reliability of these measures suggests that they have the potential to differentiate between individuals. How these findings relate to performance in actual clinical settings remains to be seen.

Read full abstract

Teacher conceptions of assessment are influential mediators of how assessment policy initiatives are implemented in schools. Four hierarchical, intercorrelated factors (i.e., assessment for improvement, school accountability, and student accountability, and assessment as irrelevant) of how teachers' conceive of assessment have been reported. However, most studies have been conducted only in English in jurisdictions with policies of low-stakes testing. This paper extends the research by surveying 249 Greek-Cypriot teachers with a Greek translation of the Teachers' Conceptions of Assessment inventory. Cyprus has a relatively low-stakes assessment policy during the compulsory school years, suggesting, under the assumption of ecological rationality, that conceptions would be similar to previous English-language studies. Exploratory factor analysis of the Cyprus data led to a five-factor solution with 24 items within two inversely correlated second-order factors (i.e., assessment is positive and negative; r = −0.49). A multigroup nested invariance confirmatory factor analysis found statistical invariance between the Cyprus and the New Zealand data. Mean score differences were small for two improvement-oriented conceptions, moderate for assessment that is bad, and large for school accountability and ignoring assessment factors. Similarities and differences in conceptions appear to reflect commonalities and discrepancies in educational system policies and practices.

Read full abstract

Low-stakes Tests Research Articles

Related Topics

Articles published on Low-stakes Tests

An Investigation of Examinee Test-Taking Effort on a Large-Scale Assessment

Management Insights

Working When No One Is Watching: Motivation, Test Scores, and Economic Success

Achievement Goal Orientation and Situational Motivation for a Low-Stakes Test of Content Knowledge

Statewide low-stakes tests and a teaching to the test effect? An analysis of teacher survey data from two German states

Time on Test, Student Motivation, and Performance on the Collegiate Learning Assessment:

Time on Test, Student Motivation, and Performance on the Collegiate Learning Assessment:

The Validation Process in Developing a Web-based English Speaking and Writing Test

Low-Stakes Testing and Psychological Reactance: Using the Hong Psychological Reactance Scale to Better Understand Compliant and Non-Compliant Examinees

Strategies to Motivate Students for Program Assessment

Self-monitoring and its relationship to medical knowledge

Test-enhanced learning in a middle school science classroom: The effects of quiz frequency and placement.

Pharmacy Students' Test-Taking Motivation-Effort on a Low-Stakes Standardized Test

Rise to the Challenge or Not Give a Damn: Differential Performance in High vs. Low Stakes Tests

Ecological rationality in teachers' conceptions of assessment across samples from Cyprus and New Zealand

Do Examinees Have Similar Test-Taking Effort? A High-Stakes Question for Low-Stakes Testing

Can Differential Rapid-Guessing Behavior Lead to Differential Item Functioning?

Examinee Noneffort and the Validity of Program Assessment Results

The relationship between motivation and achievement in low-stakes examinations

Correlates of Rapid-Guessing Behavior in Low-Stakes Testing: Implications for Test Development and Measurement Practice

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Low-stakes Tests Research Articles

Related Topics

Articles published on Low-stakes Tests

An Investigation of Examinee Test-Taking Effort on a Large-Scale Assessment

Management Insights

Working When No One Is Watching: Motivation, Test Scores, and Economic Success

Achievement Goal Orientation and Situational Motivation for a Low-Stakes Test of Content Knowledge

Statewide low-stakes tests and a teaching to the test effect? An analysis of teacher survey data from two German states

Time on Test, Student Motivation, and Performance on the Collegiate Learning Assessment:

Time on Test, Student Motivation, and Performance on the Collegiate Learning Assessment:

The Validation Process in Developing a Web-based English Speaking and Writing Test

Low-Stakes Testing and Psychological Reactance: Using the Hong Psychological Reactance Scale to Better Understand Compliant and Non-Compliant Examinees

Strategies to Motivate Students for Program Assessment

Self-monitoring and its relationship to medical knowledge

Test-enhanced learning in a middle school science classroom: The effects of quiz frequency and placement.

Pharmacy Students' Test-Taking Motivation-Effort on a Low-Stakes Standardized Test

Rise to the Challenge or Not Give a Damn: Differential Performance in High vs. Low Stakes Tests

Ecological rationality in teachers' conceptions of assessment across samples from Cyprus and New Zealand

Do Examinees Have Similar Test-Taking Effort? A High-Stakes Question for Low-Stakes Testing

Can Differential Rapid-Guessing Behavior Lead to Differential Item Functioning?

Examinee Noneffort and the Validity of Program Assessment Results

The relationship between motivation and achievement in low-stakes examinations

Correlates of Rapid-Guessing Behavior in Low-Stakes Testing: Implications for Test Development and Measurement Practice