Abstract

This paper proposes to extend the bandwidth of speech acquired by a system combining laser Doppler vibrometer (LDV) and an auxiliary microphone in adverse environments, where this system is referred to as LDV-AM. Traditional bandwidth extension (BWE) algorithms often need two stages to estimate the broadband spectral envelope. In the first stage, the codebook of the broadband spectral envelope is trained by a training data set. In the second stage, the broadband spectral envelope is estimated with the codebook by using different algorithms, such as neural networks and linear mapping. After considering that speech acquired by LDV only contains speech components lower than 2kHz, it seems improper to use the traditional BWE algorithms directly. Because these traditional BWE algorithms are often designed to extend narrawband telephone speech, which is sampled at 8kHz. With the help of the auxiliary microphone, the broadband spectral envelope of speech acquired by LDV can be estimated directly from speech acquired by this microphone. Experimental results show that the proposed BWE algorithm for LDV-AM can greatly improve MOS scores in adverse environments.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.