Do long-term acoustic-phonetic features and mel-frequency cepstral coefficients provide complementary speaker-specific information for forensic voice comparison?

Ricky K.W Chan,Bruce X Wang

doi:10.1016/j.forsciint.2024.112199

Abstract

A growing number of studies in forensic voice comparison have explored how elements of phonetic analysis and automatic speaker recognition systems may be integrated for optimal speaker discrimination performance. However, few studies have investigated the evidential value of long-term speech features using forensically-relevant speech data. This paper reports an empirical validation study that assesses the evidential strength of the following long-term features: fundamental frequency (F0), formant distributions, laryngeal voice quality, mel-frequency cepstral coefficients (MFCCs), and combinations thereof. Non-contemporaneous recordings with speech style mismatch from 75 male Australian English speakers were analyzed. Results show that 1) MFCCs outperform long-term acoustic phonetic features; 2) source and filter features do not provide considerably complementary speaker-specific information; and 3) the addition of long-term phonetic features to an MFCCs-based system does not lead to meaningful improvement in system performance. Implications for the complementarity of phonetic analysis and automatic speaker recognition systems are discussed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Do long-term acoustic-phonetic features and mel-frequency cepstral coefficients provide complementary speaker-specific information for forensic voice comparison?

Abstract

Talk to us

Similar Papers

More From: Forensic Science International

Lead the way for us

Similar Papers

Phonetic content impact on Forensic Voice Comparison
Ajili Moez ... Kahn Juliette
-
Ajili Moez, et. al.Ajili Moez ... Kahn Juliette
01 Dec 2016
01 Dec 2016

Speaker recognition utilizing distributed DCT-II based Mel frequency cepstral coefficients and fuzzy vector quantization
M Afzal Hossan ... Mark A Gregory
International Journal of Speech Technology | VOL. 16
M Afzal Hossan, et. al.M Afzal Hossan ... Mark A Gregory
28 Jun 2012
International Journal of Speech Technology | VOL. 16

Exploring the relationship between voice similarity estimates by listeners and by an automatic speaker recognition system incorporating phonetic features
Linda Gerlach ... Francis Nolan
Speech Communication | VOL. 124
Linda Gerlach, et. al.Linda Gerlach ... Francis Nolan
12 Aug 2020
Speech Communication | VOL. 124

Phonological content impact on wrongful convictions in Forensic Voice Comparison context
Moez Ajili ... Juliette Kahn
-
Moez Ajili, et. al.Moez Ajili ... Juliette Kahn
01 Mar 2017
01 Mar 2017

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Do long-term acoustic-phonetic features and mel-frequency cepstral coefficients provide complementary speaker-specific information for forensic voice comparison?

Abstract

Talk to us

Similar Papers

More From: Forensic Science International