Abstract

This paper presents our study on the feasibility and effectiveness of using the MELP (mixed excitation linear prediction) model for coding wideband (7 kHz) speech signals at a transmission bit rate of 8 kbps. In order to achieve a reasonably good subjective quality for the decoded speech while maintaining a low operating bit rate at the same time, modifications to the pitch estimation, LP analysis/synthesis and post filtering stages of the original MELP model are discussed. Informal listening tests show that the subjective quality of the decoded speech of the proposed coder is rated to be slightly better than the MPEG-4 CELP coder operating at 14.4 kbps for both male and female utterances. The subjective quality of the decoded female utterances from the proposed coder operating at 8.4 kbps is rated to be comparable to that produced by the ITU G.722 coder operating at 48 kbps.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.