Abstract

Abstract This article examines the effectiveness of artificial neural networks (ANNs) as forensic voice comparison techniques. This study specifically considers feed-forward multilayer perceptron (MLP) and radial basic function (RBF) network models. Formant frequencies of Polish vowel e (stressed or unstressed) in selected contexts were used as predictors. This has already been confirmed in an earlier investigation that determined that dynamic formant frequencies of vowels are powerful elements in distinguishing the voice. It has been concluded that neural networks might assist in distinguishing speakers from the others with very good accuracy, reaching 100%. MLP models should be given preference. The results of the investigation have shown the influence of vowel e triads on the effectiveness of correct classification rates. In addition, the authors have determined that the accuracy of classification is greater when based on a single context than for similar input data aggregated over several different contexts.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.