Abstract

We develop the real-time speech visualization system called “KanNon”[1, 2] which supports speech communication of deaf people. The KanNon system presents several information of the speech such as loudness, pitch, sound spectrogram and characters by speech recognition system in real-time. In the present system, we are adapting a word unit speech recognition system using large-scale dictionary. However the KanNon system is required quick and simple display of speech contents for smooth communication. For this purpose, we apply phonemic speech recognition system for Japanese 5 vowels using “Time-Delay Neural Network (TDNN)”. Further, we developed speech detection, voiced/unvoiced (v/uv) detection and change detection algorithms in the KanNon system. Finally, we show experimental results using real speech data.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.