Abstract

It is commonly assumed that speaker identification by human listeners is an innate skill under certain conditions. As such, human listening tests have served as the benchmark for automatic recognition systems. In recent evaluations comparing human and machine performance on a speaker comparison task, error rates of naïve human listeners far exceed those of machines [special session on Human Assisted Speaker Recognition, IEEE ICASSP, Prague, 2011]. In this presentation, we quantify the performance of naïve listeners in a variety of challenging channel conditions and we compare these results against automatic systems and trained human listeners. The results of these experiments impact the admissibility of both forensic voice analysis and courtroom testimony by human listeners.

Full Text
Published version (Free)

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call