In order for non-vocal individuals to be able to communicate in a world that is oriented toward oral communication, several speech synthesis devices, such as the TI Speech Board and DecTalk, have been developed. People like Steven Hawking, a well-known physicist afflicted with Amyotrophic Lateral Sclerosis (ALS), use a speech producing device in order to overcome their vocal limitations. However, many of these synthesizers do not produce high-quality, natural sounding speech; nor do they reflect important considerations such as language-specific dialectical differences or speech variations depending on the gender and age of the intended user. Many new technologies are providing more convenient ways for non-vocal persons to interact within a predominately vocal society, and among these new technologies are multi-lingual speech synthesis devices. As the Spanish-speaking population in the United States continues to increase, so does the need for such speech synthesizers. The purpose of this article is to provide an explanation of a Spanish text-to-speech algorithm, which currently runs on an IBM compatible personal computer and is used in conjunction with speech synthesis devices to produce speech output. It represents the combined efforts of a team of colleagues at the Speech Synthesis Laboratory of the Applied Science and Enginerring Laboratories (ASEL) at the A.I. duPont Institute. 89). These methods do not address the fact that there are phonemes in the Spanish language that do not appear in English and vice versa. For example, the trilled vibrant /rrl in Spanish is not a phoneme in the English language, and the aspirated/h/l in English does not appear in Spanish. In addition, these methods are unsatisfactory because they do not capture natural speech qualities of the foreign language. The Spanish speech produced would sound veryAmericanized and would therefore be very difficult to comprehend. An alternative method involves using letter-to-sound rules developed specifically for the Spanish language in conjunction with a library of Spanish sounds to create high quality, highly intelligible Spanish speech. The first step in producing a Spanish text-tospeech system is to create an inventory of Spanish diphones, which are phonemic translations of letters. A diphone library contains at most n2 n entries, where n is the number of phonemes. The Spanish language consists of approximately 28 phonemes which implies a diphone library of 756 units. Let a 2-permutation represent a diphone. A 2-permutation is a coupled, ordered arrangement of all the phonemes (not including repetitions) of a set of Spanish phonemes, S, such as ab, ba, ac, ca, etc. Calculate the number of 2permutations of S using the following formula (Johnsonbaugh 51):
Read full abstract