High Assurance Research Articles

The growing body of working dog literature includes many examples of scales robustly developed to measure aspects of dog behavior. However, when comparing behavior to working dog ability, most studies rely on training organizations' own long-established ratings of performance, or simply pass/fail at selection or certification as measures of success. Working ability is multifaceted, and it is likely that different aspects of ability are differentially affected by external factors. In order to understand how specific aspects of selection, training, and operations influence a dog's working ability, numerous facets of performance should be considered. An accurate and validated method for quantifying multiple aspects of performance is therefore required. Here, we describe the first stages of formulating a meaningful performance measurement tool for two types of working search dogs. The systematic methodology used was: (1) interviews and workshops with a representative cross-section of stakeholders to produce a shortlist of behaviors integral to current operational performance of vehicle (VS) and high assurance (HAS) search dogs; (2) assessing the reliability and construct validity of the shortlisted behavioral measures (at the behavior and the individual rater level) using ratings of diverse videoed searches by experienced personnel; and (3) selecting the most essential and meaningful behaviors based on their reliability/validity and importance. The resulting performance measurement tool was composed of 12 shortlisted behaviors, most of which proved reliable and valid when assessed by a group of raters. At the individual rater level, however, there was variability between raters in the ability to use and interpret behavioral measures, in particular, more abstract behaviors such as Independence. This illustrates the importance of examining individual rater scores rather than extrapolating from group consensus (as is often done), especially when designing a tool that will ultimately be used by single raters. For ratings to be practically valuable, individual rater reliability needs to be improved, especially for behaviors deemed as essential (e.g., control and confidence). We suggest that the next steps are to investigate why individuals vary in their ratings and to undertake efforts to increase the likelihood that they reach a common conceptualization of each behavioral construct. Plausible approaches are improving the format in which behaviors are presented, e.g., by adding benchmarks and utilizing rater training.

Read full abstract

Rating scales are widely used to rate working dog behavior and performance. Whilst behaviour scales have been extensively validated, instruments used to rate ability have usually been designed by training and practitioner organizations, and often little consideration has been given to how seemingly insignificant aspects of the scale design might alter the validity of the results obtained. Here we illustrate how manipulating one aspect of rating scale design, the provision of verbal benchmarks or labels (as opposed to just a numerical scale), can affect the ability of observers to distinguish between differing levels of search dog performance in an operational environment. Previous studies have found evidence for range restriction (using only part of the scale) in raters' use of the scales and variability between raters in their understanding of the traits used to measures performance. As provision of verbal benchmarks has been shown to help raters in a variety of disciplines to select appropriate scale categories (or scores), it may be predicted that inclusion of verbal benchmarks will bring raters' conceptualization of the traits closer together, increasing agreement between raters, as well as improving the ability of observers to distinguish between differing levels of search dog performance and reduce range restriction. To test the value of verbal benchmarking we compared inter-rater reliability, raters' ability to discriminate between different levels of search dog performance, and their use of the whole scale before and after being presented with benchmarked scales for the same traits. Raters scored the performance of two separate types of explosives search dog (High Assurance Search (HAS) and Vehicle Search (VS) dogs), from short (~30 s) video clips, using 11 previously validated traits. Taking each trait in turn, for the first five clips raters were asked to give a score from 1, representing the lowest amount of the trait evident to 5, representing the highest. Raters were given a list of adjective-based benchmarks (e.g., very low, low, intermediate, high, very high) and scored a further five clips for each trait. For certain traits, the reliability of scoring improved when benchmarks were provided (e.g., Motivation and Independence), indicating that their inclusion may potentially reduce ambivalence in scoring, ambiguity of meanings, and cognitive difficulty for raters. However, this effect was not universal, with the ratings of some traits remaining unchanged (e.g., Control), or even reducing in reliability (e.g., Distraction). There were also some differences between VS and HAS (e.g., Confidence reliability increased for VS raters and decreased for HAS raters). There were few improvements in the spread of scores across the range, but some indication of more favorable scoring. This was a small study of operational handlers and trainers utilizing training video footage from realistic operational environments, and there are potential cofounding effects. We discuss possible causal factors, including issues specific to raters and possible deficiencies in the chosen benchmarks, and suggest ways to further improve the effectiveness of rating scales. This study illustrates why it is vitally important to validate all aspects of rating scale design, even if they may seem inconsequential, as relatively small changes to the amount and type of information provided to raters can have both positive and negative impacts on the data obtained.

Read full abstract

High Assurance Research Articles

Related Topics

Articles published on High Assurance

VTrust: Remotely Executing Mobile Apps Transparently With Local Untrusted OS

The Use of Digital Technologies for the Purpose of Improving Methodological Approaches to the Creation of a Pharmaceutical Quality System at Enterprises for the Production of Medicines

A Verified Formal Specification of A Secured Communication Method For Smart Card Applications

Assurance quality, disclosed connectivity of the capitals and information asymmetry – An interaction analysis for the case of integrated reporting

A Framework for Model and Verification of Safety-Critical Operating System Based on ARINC653

Development of a Performance Monitoring Instrument for Rating Explosives Search Dog Performance

A goal‐driven approach for the joint deployment of safety and security standards for operators of essential services

Role-Based Profiling Using Fuzzy Adaptive Resonance Theory for Securing Database Systems

RFID Tracking Implementation for Supplier Chain Management at Toyota USA: Proposal of Development of Advanced TPS for Global Production Strategy

Metacognitive Ability and Academic Self-Efficacy: Their Relations to Role Transition as Perceived by Nursing students

Does Benchmarking of Rating Scales Improve Ratings of Search Performance Given by Specialist Search Dog Handlers?

Koord: a language for programming and verifying distributed robotics application

Formalization of Camera Pose Estimation Algorithm based on Rodrigues Formula

The geoforensic search strategy: A high assurance search method to assist law enforcement locate graves and contraband associated with homicide, counter terrorism and serious and organised crime

Making Teaching Relevant: Enhancing Students’ Self-Efficacy Through Teachers’ Enthusiasm for More Active Classroom Engagement

New Operational Availability Model to Evaluate Manufacturing Throughput: Advanced TPS for Global Production

Provenance-enabled packet path tracing in the RPL-based internet of things

Views of corporate managers on assurance of sustainability reporting: evidence from Japan

Assuring Clonality on the Beacon Digital Cell Line Development Platform.

P188 QUALITY ASSURANCE OF SURGICAL INTERVENTION WITHIN RANDOMIZED CONTROLLED TRIALS FOR GASTRO-ESOPHAGEAL REFLUX DISEASE

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

High Assurance Research Articles

Related Topics

Articles published on High Assurance

VTrust: Remotely Executing Mobile Apps Transparently With Local Untrusted OS

The Use of Digital Technologies for the Purpose of Improving Methodological Approaches to the Creation of a Pharmaceutical Quality System at Enterprises for the Production of Medicines

A Verified Formal Specification of A Secured Communication Method For Smart Card Applications

Assurance quality, disclosed connectivity of the capitals and information asymmetry – An interaction analysis for the case of integrated reporting

A Framework for Model and Verification of Safety-Critical Operating System Based on ARINC653

Development of a Performance Monitoring Instrument for Rating Explosives Search Dog Performance

A goal‐driven approach for the joint deployment of safety and security standards for operators of essential services

Role-Based Profiling Using Fuzzy Adaptive Resonance Theory for Securing Database Systems

RFID Tracking Implementation for Supplier Chain Management at Toyota USA: Proposal of Development of Advanced TPS for Global Production Strategy

Metacognitive Ability and Academic Self-Efficacy: Their Relations to Role Transition as Perceived by Nursing students

Does Benchmarking of Rating Scales Improve Ratings of Search Performance Given by Specialist Search Dog Handlers?

Koord: a language for programming and verifying distributed robotics application

Formalization of Camera Pose Estimation Algorithm based on Rodrigues Formula

The geoforensic search strategy: A high assurance search method to assist law enforcement locate graves and contraband associated with homicide, counter terrorism and serious and organised crime

Making Teaching Relevant: Enhancing Students’ Self-Efficacy Through Teachers’ Enthusiasm for More Active Classroom Engagement

New Operational Availability Model to Evaluate Manufacturing Throughput: Advanced TPS for Global Production

Provenance-enabled packet path tracing in the RPL-based internet of things

Views of corporate managers on assurance of sustainability reporting: evidence from Japan

Assuring Clonality on the Beacon Digital Cell Line Development Platform.

P188 QUALITY ASSURANCE OF SURGICAL INTERVENTION WITHIN RANDOMIZED CONTROLLED TRIALS FOR GASTRO-ESOPHAGEAL REFLUX DISEASE