Abstract
This research studies the perceptual evaluation of speech signals using an inexpensive recording device. Different types of noise-reduction and electronic enhancement filters viz. Hamming window, high-pass filter (HPF), Wiener-filter and no-speech activity-cancelling were applied in compliance with the testing conditions such as P.835. In total, 41 volunteers participated in the study for identifying the effects of those filters following a repeatable approach. Performance was assessed in terms of advanced perceptual audio features. This study is believed to be beneficial for both users and device manufacturers as the suggested technique is relatively simple to embed in operational device algorithms or in the master GPU.
Highlights
Though the Wiener technique is widely known for the effective processing of noisy images [32], it is employed in recent research for audio signal processing for both single-channel speech enhancement [33] and Perceptual Evaluation of Speech Quality (PESQ)-based speech enhancement [24]
Results from subjective tests carried out in compliance with the ITU-T Recommendation P.835 are given in Table 3, where each run summarizes an average of four audio clips
Playing “non-silence zones” for longer durations made the listeners uncomfortable due to the sudden silent zones; this method is not recommended to be used for speech that is longer than 30 s in duration
Summary
P.835 to assess the quality of speech recorded via an inexpensive microphone. The speech signals were processed using several audio enhancement techniques and filters. As the experiments involved human interactions, all necessary approvals were obtained from the. Dean of the College of Engineering at the University of Jeddah in Saudi Arabia
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have