Abstract

The authors propose a new speech communication system to convert an oral image into voice, called image input microphone. This system synthesizes the voice from only the oral image. It provides high security and is not affected by acoustic noise. Since the voice is synthesized without recognition, this system is independent of languages. Simulations to convert oral image to voice for five Japanese vowels were carried out. A vocal tract area function is estimated from the oral image, and a PARCOR synthesis filter is obtained from the vocal tract area function. The PARCOR synthesis filter is driven by a pulse train. The performance of this system is evaluated by hearing tests of the synthesized voice. As a result, an audible voice has been synthesized and the mean recognition rate of five Japanese vowels has been 91(%). >

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.