Abstract
We present the use of stethoscope and silicon NAM (nonaudible murmur) microphones in automatic speech recognition. NAM microphones are special acoustic sensors, which are attached behind the talker's ear and can capture not only normal (audible) speech, but also very quietly uttered speech (nonaudible murmur). As a result, NAM microphones can be applied in automatic speech recognition systems when privacy is desired in human-machine communication. Moreover, NAM microphones show robustness against noise and they might be used in special systems (speech recognition, speech transform, etc.) for sound-impaired people. Using adaptation techniques and a small amount of training data, we achieved for a 20 k dictation task a word accuracy for nonaudible murmur recognition in a clean environment. In this paper, we also investigate nonaudible murmur recognition in noisy environments and the effect of the Lombard reflex on nonaudible murmur recognition. We also propose three methods to integrate audible speech and nonaudible murmur recognition using a stethoscope NAM microphone with very promising results.
Highlights
The NAM microphone [1] belongs to the acoustic sensor paradigm, in which speech is conducted not through the air, but within body tissues, bone, or the ear canal
We present the use of stethoscope and silicon NAM microphones in automatic speech recognition
We presented nonaudible murmur recognition in clean and noisy environments using NAM microphones
Summary
The NAM microphone [1] belongs to the acoustic sensor paradigm, in which speech is conducted not through the air, but within body tissues, bone, or the ear canal. NAM microphones are special acoustic sensors, which can capture normal (audible) speech, and very quietly uttered speech (nonaudible murmur). Since a NAM microphone receives the speech signal directly from the body, it shows robustness against the environmental noises. It might be used in special systems (speech recognition, speech transform, etc.) for sound-impaired people. In [7] speaker-dependent nonaudible murmur recognition in a clean environment and using a stethoscope NAM microphone was reported. ×103 6 4 2 hms 0.4 0.8 1.2 1.6 2 2.4 2.8 3.2 hms Figure 2: Spectrogram of an audible Japanese utterance captured by a NAM microphone
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.