The question of disguised voice

Patrick Perrot,Gerard Chollet

doi:10.1121/1.2935782

Abstract

State of the art automatic speaker recognition systems show very good results in the discrimination between different speakers under controlled recording conditions. In a forensic context, the conditions are uncontrolled and voice can be disguised. In cases of terrorism claim, extortion or kidnapping, it is of great interest for offenders to conceal their identity. Voice disguise is an important constraint to speaker discrimination. Some disguises produce a great variation of parameters and change the perception of an identity. The main risk is to confound a disguised voice and a normal voice and accuse an innocent individual. This paper proposes on one hand to present the impact of voice disguise on automatic speaker recognition and, on the other hand a statistical study in order to detect and identify four disguises among the most common. The first step consists in extracting features and the second step to classify them. MFCC (Mel Frequency Cepstral Coefficient) are considered as features and different classification algorithms have been tested. The studied disguises are based on a deliberated and non electronic way. The proposed analysis of disguised voice classification provides interesting results in detection by the use of SVM (Support Vector Machine) and in identification by the use of GMM (Gaussian mixture models).

Full Text