Time-frequency voiced and unvoiced models are proposed for the excitation of a harmonic autoregressive wideband speech analysis-synthesis system. The time-frequency voiced excitation (TFVEX) model has low time resolution defined by the concentration of the excitation signal distribution in the modulation domain while the time-frequency unvoiced excitation (TFUNEX) model has cycle time discrimination with lower amplitude resolution and while the frequency resolution for both models is an octave. The speech reconstructed by the compound TFUVEX unvoiced-voiced model is rated above the speech degraded by a modulated noise reference unit (MNRU) at 25 dB in listening tests while yielding a parametric compression of over ten times.
Read full abstract