Speech synthesis system

Richard Anthony Sharman

doi:10.1121/1.423023

Speech synthesis system

Richard Anthony Sharman

https://doi.org/10.1121/1.423023

Copy DOI

Export

Save

Cite

Journal: The Journal of the Acoustical Society of America

Publication Date: Jun 1, 1998

#Hidden Markov Model #Sequence Of Phonemes #Speech Synthesis System #Speech Synthesis #State Transition #Speech Synthesis Unit #Output Distributions #Text Processor #Input Sequence #Audio Signal

Abstract
Full-Text
Similar Papers

Abstract

Listen

A speech synthesis unit comprises a text processor which breaks down text into phonemes, a prosodic processor which assigns properties such as length and pitch to the phonemes based on context, and a synthesis unit which outputs an audio signal representing the sequence of phonemes according to the specified properties. The prosodic processor includes a Hidden Markov Model (HMM) to predict the durations of the phonemes. Each state of the HMM represents a duration, and the outputs are phonemes. The HMM is trained on a set of data consisting of phonemes of known identity and duration, to allow the state transition and output distributions to be calculated. The HMM can then be used for any given input sequence of phonemes to predict a most likely sequence of corresponding durations.

Full Text

Published Version

Check institute access

Open DOI Link

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Similar Papers

Paper Title

Journal

Date

Author

View more papers

More From: The Journal of the Acoustical Society of America

Paper Title

Journal

Date

Author

View more papers

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.

R Discovery Prime

Speech synthesis system