Intelligibility Assessment of the De-Identified Speech Obtained Using Phoneme Recognition and Speech Synthesis Systems

Tadej Justin,France Mihelič,Simon Dobrišek

doi:10.1007/978-3-319-10816-2_64

Abstract

The paper presents and evaluates a speaker de-identification technique using speech recognition and two speech synthesis techniques. The phoneme recognition system is built using HMM-based acoustical models of context-dependent diphone speech units, and two different speech synthesis systems (diphone TD-PSOLA-based and HMM-based) are employed for re-synthesizing the recognized sequences of speech units. Since the acoustical models of the two speech synthesis systems are assumed to be completely independent of the input speaker’s voice, the highest level of input speaker de-identification is ensured. The proposed de-identification system is considered to be language dependent, but is, however, vocabulary and speaker independent since it is based mainly on acoustical modelling of the selected diphone speech units. Due to the relatively simple computing methods, the whole de-identification procedure runs in real-time.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Intelligibility Assessment of the De-Identified Speech Obtained Using Phoneme Recognition and Speech Synthesis Systems

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Speaker de-identification using diphone recognition and speech synthesis
Tadej Justin ... Bostjan Vesnicer
-
Tadej Justin, et. al.Tadej Justin ... Bostjan Vesnicer
01 May 2015
01 May 2015

Automatic phoneme recognition with Segmental Hidden Markov Models
Areg G Baghdasaryan ... A A Beex
-
Areg G Baghdasaryan, et. al.Areg G Baghdasaryan ... A A Beex
01 Nov 2011
01 Nov 2011

A fast phoneme recognition system based on sparse representation of test utterances
Armin Saeb ... Farbod Razzazi
-
Armin Saeb, et. al.Armin Saeb ... Farbod Razzazi
01 May 2014
01 May 2014

Phonetic alignment: speech synthesis-based vs. Viterbi-based
F Malfrère ... C Ris
Speech Communication | VOL. 40
F Malfrère, et. al.F Malfrère ... C Ris
13 Sep 2002
Speech Communication | VOL. 40

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Intelligibility Assessment of the De-Identified Speech Obtained Using Phoneme Recognition and Speech Synthesis Systems

Abstract

Talk to us

Similar Papers