Abstract

We utilize the inter-frame redundancy with the larger-size super-frame structure to realize ultra low bit rate speech encoding. A new clustering model of speech characteristics is proposed to process effectively the parameters of large super-frames. Based on the model, we present algorithms for ultra low bit rate speech encoding at 600 bps and 300 bps for applications in acoustically harsh environments. At the decoder, a close-loop excitation signal magnitude estimation model is employed to improve the naturalness of synthesized speech. Two prototypes have been realized and evaluated using the DRT tests based on the national standard of China. Both prototypes are able to synthesize high quality of speech with DRT score 88.85 and 81.78 respectively.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.