Abstract

In a previous paper [S. E. Estes, H. R. Kerby, H. D. Maxey, and R. M. Walker, “Speech Synthesis from Stored Data,” J. Acoust. Soc. Am. 34, 2003(A) (1962)], a method for unlimited vocabulary synthesis was presented and discussed. The method consists of computer assembly of digital data representing diphone-length segments, together with auxiliary data specifying intonation and timing, to produce synthetic speech. In order to evaluate the adequacy of this method of segmentation, intelligilibity tests have been conducted using PB words in a standard carrier phrase. The results of these studies have indicated that intelligibility scores on synthetically produced PB words are affected by at least two major factors: familiarity with the words under test and familiarity with the diphone components of the words. A study has been designed to test the effects of these two factors on the intelligibility scores of normal adult listeners in sound-field conditions. The stimuli used are selected C.I.D. W-22 lists. These results are presented and discussed.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.