Abstract

Experiments were performed to investigate roles of voice source and vocal tract characteristics in the perception of talker individuality by using a new formant-type speech analysis synthesis-editing system based on the ARX (auto-regressive with exogenous input) speech production model. One of the key features of the system is implementation of the algorithm that can automatically estimate voice source parameters and vocal tract parameters of the synthesizer directly from a speech utterance. Results of the experiments among five male talkers show that vocal tract characteristics contribute more to the perception of talker individuality than the voice source and the static component of the formant trajectories is a primary cue to talker individuality.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.