Abstract

We present the use of stethoscope and silicon NAM (nonaudible murmur) microphones in automatic speech recognition. NAM microphones are special acoustic sensors, which are attached behind the talker's ear and can capture not only normal (audible) speech, but also very quietly uttered speech (nonaudible murmur). As a result, NAM microphones can be applied in automatic speech recognition systems when privacy is desired in human-machine communication. Moreover, NAM microphones show robustness against noise and they might be used in special systems (speech recognition, speech transform, etc.) for sound-impaired people. Using adaptation techniques and a small amount of training data, we achieved for a 20 k dictation task a word accuracy for nonaudible murmur recognition in a clean environment. In this paper, we also investigate nonaudible murmur recognition in noisy environments and the effect of the Lombard reflex on nonaudible murmur recognition. We also propose three methods to integrate audible speech and nonaudible murmur recognition using a stethoscope NAM microphone with very promising results.

Highlights

  • The NAM microphone [1] belongs to the acoustic sensor paradigm, in which speech is conducted not through the air, but within body tissues, bone, or the ear canal

  • We present the use of stethoscope and silicon NAM microphones in automatic speech recognition

  • We presented nonaudible murmur recognition in clean and noisy environments using NAM microphones

Read more

Summary

INTRODUCTION

The NAM microphone [1] belongs to the acoustic sensor paradigm, in which speech is conducted not through the air, but within body tissues, bone, or the ear canal. NAM microphones are special acoustic sensors, which can capture normal (audible) speech, and very quietly uttered speech (nonaudible murmur). Since a NAM microphone receives the speech signal directly from the body, it shows robustness against the environmental noises. It might be used in special systems (speech recognition, speech transform, etc.) for sound-impaired people. In [7] speaker-dependent nonaudible murmur recognition in a clean environment and using a stethoscope NAM microphone was reported. ×103 6 4 2 hms 0.4 0.8 1.2 1.6 2 2.4 2.8 3.2 hms Figure 2: Spectrogram of an audible Japanese utterance captured by a NAM microphone

Figure 1
NONAUDIBLE MURMUR CHARACTERISTICS
NONAUDIBLE MURMUR AUTOMATIC RECOGNITION
Experiments using clean and simulated noisy test data
Experiments using real noisy test data
THE ROLE OF THE LOMBARD REFLEX IN NONAUDIBLE MURMUR RECOGNITION
Lombard nonaudible murmur recognition using matched and crossed HMMs
AUDIBLE SPEECH RECOGNITION USING A STETHOSCOPE MICROPHONE
Using a combined HMM set
Findings
CONCLUSIONS

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.