Forensic voice comparison by means of artificial neural networks

Kinga Sałapa,Agata Trawińska,Irena Roterman-Konieczna

doi:10.1515/bams-2013-0153

Abstract

Abstract This article examines the effectiveness of artificial neural networks (ANNs) as forensic voice comparison techniques. This study specifically considers feed-forward multilayer perceptron (MLP) and radial basic function (RBF) network models. Formant frequencies of Polish vowel e (stressed or unstressed) in selected contexts were used as predictors. This has already been confirmed in an earlier investigation that determined that dynamic formant frequencies of vowels are powerful elements in distinguishing the voice. It has been concluded that neural networks might assist in distinguishing speakers from the others with very good accuracy, reaching 100%. MLP models should be given preference. The results of the investigation have shown the influence of vowel e triads on the effectiveness of correct classification rates. In addition, the authors have determined that the accuracy of classification is greater when based on a single context than for similar input data aggregated over several different contexts.

Full Text