Abstract

State of the art automatic speaker recognition systems show very good results in the discrimination between different speakers under controlled recording conditions. In a forensic context, the conditions are uncontrolled and voice can be disguised. In cases of terrorism claim, extortion or kidnapping, it is of great interest for offenders to conceal their identity. Voice disguise is an important constraint to speaker discrimination. Some disguises produce a great variation of parameters and change the perception of an identity. The main risk is to confound a disguised voice and a normal voice and accuse an innocent individual. This paper proposes on one hand to present the impact of voice disguise on automatic speaker recognition and, on the other hand a statistical study in order to detect and identify four disguises among the most common. The first step consists in extracting features and the second step to classify them. MFCC (Mel Frequency Cepstral Coefficient) are considered as features and different classification algorithms have been tested. The studied disguises are based on a deliberated and non electronic way. The proposed analysis of disguised voice classification provides interesting results in detection by the use of SVM (Support Vector Machine) and in identification by the use of GMM (Gaussian mixture models).

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.