Abstract
Problem statement: In the modern speech communication technology, the speech coding with bitrate scalability was needed. However, various types of noises in the communication channels cause damages in the transmitted information especially speech data. Tonal-language speech was also affected by this situation. Approach: Based on the high pitch delay resolution technique, the MPCELP speech coding was proposed over the environment with CDMA AWGN and Rayleigh fading channels for tonal language. The proposed coder supports multiple bitrates and also has the functionality of bitrate scalability. Results: Through performance analysis and computer simulation, the quality of the proposed coding was presented with an improvement from conventional scalable MP-CELP in the specific-noise environments. The HPDR technique was applied to the MP-CELP to use for tonal language, meanwhile it can support the core coding rate of 5.6, 8.2, 12.2 kbps and additional scaled bitrates. Conclusion: By applying the high pitch delay resolution technique with the MP-CELP speech coding, we can improve the quality of tonal encoded speech. Moreover, the coding quality of the proposed coder was better than the conventional coder for Thai language over both AWGN channel and Rayleigh fading channel.
Highlights
Nowadays the digital communications are widely developed
Since pitch delay significantly involves in tone of tonal language, this study proposes an improvement of the bitrate scalable MP-CELP coder by applying the High Pitch Delay Resolutions (HPDR) technique to the pitch analysis of the core coder
This indicates that the proposed HPDR technique brings about better pitch precision which causes the improvement of the coding quality for tonal language over both AWGN channel and Rayleigh fading channel
Summary
Nowadays the digital communications are widely developed. The audio, images, video or data information can be transmitted pass through wire or wireless network channels. To improve speech quality for background noise conditions, the adaptive pulse location restriction method are applied (Ozawa and Serizawa, 1998) This coder operates at various bitrates ranging from 4-12 kbps utilizing the flexibility in multi-pulse excitation coding (Nomura et al, 1998). This study proposes a bitrate scalable tonal language speech coder based on a multi-pulse based code excited linear predictive coding (Taumi et al, 1996; Ozawa et al, 1997). The proposed coder provides the bitrate scalabilities which is effective in multimedia communications This coder is improved for the tonal language speech by applying the high pitch delay resolutions to retain the tone information precision
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.