Human error rates for speaker recognition

Wade Shen,Reva Schwartz,Joseph P Campbell

doi:10.1121/1.3655182

Abstract

It is commonly assumed that speaker identification by human listeners is an innate skill under certain conditions. As such, human listening tests have served as the benchmark for automatic recognition systems. In recent evaluations comparing human and machine performance on a speaker comparison task, error rates of naïve human listeners far exceed those of machines [special session on Human Assisted Speaker Recognition, IEEE ICASSP, Prague, 2011]. In this presentation, we quantify the performance of naïve listeners in a variety of challenging channel conditions and we compare these results against automatic systems and trained human listeners. The results of these experiments impact the admissibility of both forensic voice analysis and courtroom testimony by human listeners.

Full Text