Abstract
We develop the real-time speech visualization system called “KanNon”[1, 2] which supports speech communication of deaf people. The KanNon system presents several information of the speech such as loudness, pitch, sound spectrogram and characters by speech recognition system in real-time. In the present system, we are adapting a word unit speech recognition system using large-scale dictionary. However the KanNon system is required quick and simple display of speech contents for smooth communication. For this purpose, we apply phonemic speech recognition system for Japanese 5 vowels using “Time-Delay Neural Network (TDNN)”. Further, we developed speech detection, voiced/unvoiced (v/uv) detection and change detection algorithms in the KanNon system. Finally, we show experimental results using real speech data.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
More From: Proceedings of the ISCIE International Symposium on Stochastic Systems Theory and its Applications
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.