Speech Synthesis Method Research Articles

A speech synthesizer uses a digital waveguide network to simulate operation of the human pharynx on acoustic signals. One end of the digital waveguide network is connected to a glottal signal source, and another end has a signal filter simulating operation of the acoustic interface at a person's lips. The digital waveguide network has sets of waveguide sections connected in series by junctions, each waveguide section including two digital delay lines running parallel to each other for propagating signals in opposite directions. Each waveguide junction has associated reflection and propagation coefficients. A parameter library that stores sets of glottal source and waveguide junction control parameters for generating corresponding sets of predefined speech signals. The waveguide junction control parameters cause the digital waveguide network to simulate operation of an acoustic tube with a shape corresponding to that of a human pharynx while producing predefined speech sounds. An articulation controller operates the glottal signal source and the digital waveguide network using a sequence of selected sets of said control parameters, thereby causing the synthesizer to generate a specified sequence of speech signals. In a preferred embodiment, the digital waveguide network has three interconnected network branches for simulating operation of the lower pharynx, the oropharynx and the nasopharynx. To generate speech signals corresponding to fricative consonants, the speech synthesizer has noise signal injectors positioned at various points along the digital waveguide network.

Read full abstract

There are two approaches for constructing an appropriate fundamental frequency (F0) control method for speech synthesis: statistical and rule-based. The statistical approach has the advantage of automatic training, but it requires a large corpora of speech that is annotated with prosodic boundaries. Recently, a method is proposed for high-accuracy detection of these boundaries [Ostendorf and Ross (1996)], given a set of prosodic boundary candidates in which almost all the correct boundaries are included. This paper proposes a detection method to generate these boundary candidates, specifically for accentual phrases which represent one of the smallest prosodic units. The detection algorithm uses local maximums and minimums of the F0 contour and low-energy regions of the speech waveform for finding candidate regions that correspond to accentual phrases and pauses in speech. The candidate phrase boundaries are then aligned to the nearest phoneme boundaries, which are detected automatically using forced alignment with a speaker-independent speech recognition system given a phoneme transcription. This method was applied to 250 read Japanese sentences. High-detection accuracy (97%) was obtained, with almost all the missed detections having valid candidates within ±3 phonemes. The insertion error rate was less than double the number of correct boundaries.

Read full abstract

Speech Synthesis Method Research Articles

Related Topics

Articles published on Speech Synthesis Method

Speech synthesis method utilizing auxiliary information, medium recorded thereon the method and apparatus utilizing the method

Method of speech synthesis by means of concentration and partial overlapping of waveforms

遺伝的アルゴリズムに基づく音声合成のためのスペクトルパタン圧縮法

Speech synthesizer having an acoustic element database

Speech synthesis method based on application-specific synthesis units and its implementation on a 32-bit microprocessor

Speech synthesis apparatus and method for synthesizing speech from a character series comprising a text and pitch information

The role of speech synthesis in Requiem per una veu perduda

Method of speech representation and synthesis using a set of high level constrained parameters

Text-to-speech synthesis by concatenation using or modifying clustered phoneme waveforms on basis of cluster parameter centroids

Speech synthesis system and method utilizing phoneme information and rhythm information

Automatic creation of CV templates for formant type speech synthesis based on HMM-based segmentation and syllable boundary detection

Digital waveguide speech synthesis system and method

Automatic detection of accentual phrase boundaries using prosodic features and phoneme boundaries

Speech synthesis by rule based on VCV waveform synthesis units

Method and apparatus for speech analysis and synthesis by sampling a power spectrum of input speech

Method and apparatus for speech synthesis based on prosodic analysis

System and method for speech synthesis employing improved formant composition

Analysis of quality factors in synthetic speech produced by rules

FRACTAL MODELING OF SPEECH SIGNALS

Speech synthesis apparatus and method

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Speech Synthesis Method Research Articles

Related Topics

Articles published on Speech Synthesis Method

Speech synthesis method utilizing auxiliary information, medium recorded thereon the method and apparatus utilizing the method

Method of speech synthesis by means of concentration and partial overlapping of waveforms

遺伝的アルゴリズムに基づく音声合成のためのスペクトルパタン圧縮法

Speech synthesizer having an acoustic element database

Speech synthesis method based on application-specific synthesis units and its implementation on a 32-bit microprocessor

Speech synthesis apparatus and method for synthesizing speech from a character series comprising a text and pitch information

The role of speech synthesis in Requiem per una veu perduda

Method of speech representation and synthesis using a set of high level constrained parameters

Text-to-speech synthesis by concatenation using or modifying clustered phoneme waveforms on basis of cluster parameter centroids

Speech synthesis system and method utilizing phoneme information and rhythm information

Automatic creation of CV templates for formant type speech synthesis based on HMM-based segmentation and syllable boundary detection

Digital waveguide speech synthesis system and method

Automatic detection of accentual phrase boundaries using prosodic features and phoneme boundaries

Speech synthesis by rule based on VCV waveform synthesis units

Method and apparatus for speech analysis and synthesis by sampling a power spectrum of input speech

Method and apparatus for speech synthesis based on prosodic analysis

System and method for speech synthesis employing improved formant composition

Analysis of quality factors in synthetic speech produced by rules

FRACTAL MODELING OF SPEECH SIGNALS

Speech synthesis apparatus and method