How Good Is My Test Data? Introducing Safety Analysis for Computer Vision

Oliver Zendel,Martin Humenberger,Markus Murschitz,Wolfgang Herzner

doi:10.1007/s11263-017-1020-z

Oliver Zendel, Martin Humenberger + Show 2 more

Open Access

https://doi.org/10.1007/s11263-017-1020-z

Copy DOI

Abstract

Good test data is crucial for driving new developments in computer vision (CV), but two questions remain unanswered: which situations should be covered by the test data, and how much testing is enough to reach a conclusion? In this paper we propose a new answer to these questions using a standard procedure devised by the safety community to validate complex systems: the hazard and operability analysis (HAZOP). It is designed to systematically identify possible causes of system failure or performance loss. We introduce a generic CV model that creates the basis for the hazard analysis and—for the first time—apply an extensive HAZOP to the CV domain. The result is a publicly available checklist with more than 900 identified individual hazards. This checklist can be utilized to evaluate existing test datasets by quantifying the covered hazards. We evaluate our approach by first analyzing and annotating the popular stereo vision test datasets Middlebury and KITTI. Second, we demonstrate a clearly negative influence of the hazards in the checklist on the performance of six popular stereo matching algorithms. The presented approach is a useful tool to evaluate and improve test datasets and creates a common basis for future dataset designs.

Highlights

Many safety-critical systems depend on computer vision (CV) technologies to navigate or manipulate their environment and require a thorough safety assessment due to the evident risk to human lives (Matthias et al 2010)
We show that the CV-hazard and operability analysis (HAZOP) correctly identifies challenging situations and on the other hand, we provide a guideline for all researches to do their own analysis of test data
The goal is to show that the entries of the CV-HAZOP are meaningful and that the checklist is a useful tool to evaluate robustness of CV algorithms

Summary

Introduction

Many safety-critical systems depend on CV technologies to navigate or manipulate their environment and require a thorough safety assessment due to the evident risk to human lives (Matthias et al 2010). This work presents a new way to facilitate a safety assessment process to overcome these problems: a standard method developed by the safety community is applied to the CV domain for the first time. A big problem when validating CV algorithms is the enormous set of possible test images. Validation tries to show that the algorithm can reliably solve the task at hand, even under difficult conditions Both use application specific datasets, their goals are different and benchmarking sets are not suited for validation. The main challenge for validation in CV is listing elements and relations which are known to be “difficult” for CV algorithms (comparable to optical illusions for humans). The impact of identified hazards on the output of multiple stereo vision algorithms is compared in Sect.

Related Work

Risk Analysis

Robustness

CV-HAZOP

Generic Model

Guide Words

Locations

Parameters

Implementation

Execution

Application

Each row represents a unique Hazard and has a unique

Evaluation

Performance Evaluation

Interpretation

Statistical Significance

Conclusion

Outlook

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: International Journal of Computer Vision	Publication Date: Jun 9, 2017
Citations: 45	License type: open-access

R Discovery Prime

R Discovery Prime

How Good Is My Test Data? Introducing Safety Analysis for Computer Vision

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Computer Vision

Lead the way for us

Similar Papers

CV-HAZOP: Introducing Test Data Validation for Computer Vision
Oliver Zendel ... Wolfgang Herzner
-
Oliver Zendel, et. al.Oliver Zendel ... Wolfgang Herzner
01 Dec 2015
01 Dec 2015

223. Development and validation of a novel scoring tool for predicting facility discharge after elective posterior lumbar fusion
Garrett Harada ... Howard S An
The Spine Journal | VOL. 20
Garrett Harada, et. al.Garrett Harada ... Howard S An
01 Sep 2020
The Spine Journal | VOL. 20

System life data analysis with dependent partial knowledge on the exact cause of system failure
Dennis K.J Lin ... Frank M Guess
Microelectronics Reliability | VOL. 34
Dennis K.J Lin, et. al.Dennis K.J Lin ... Frank M Guess
01 Mar 1994
Microelectronics Reliability | VOL. 34

Estimating system and component reliabilities under partial information on cause of failure
Frank M Guess ... John S Usher
Journal of Statistical Planning and Inference | VOL. 29
Frank M Guess, et. al.Frank M Guess ... John S Usher
01 Sep 1991
Journal of Statistical Planning and Inference | VOL. 29

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

How Good Is My Test Data? Introducing Safety Analysis for Computer Vision

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: International Journal of Computer Vision