Abstract
In an attempt to enhance voice recognition capabilities, several transformations have been applied to the speech acoustical signal. The transformations include first- and second-order partial differentiation of the amplitude of the frequency-time-amplitude contour with respect to frequency and with respect to time, and variable segmentation of this contour and the derivative contours based upon criteria obtained from the partial derivative information. To provide the initial data base for the experiments, an instrumentation system was implemented to digitize the speech contours with the high degree of resolution of 256 frequency channels and 125 time increments or 32 000 data points. Computer-generated contour spectrograms were produced for the original contour and for the first and second partial-derivative contours. Speech recognition and speaker identification systems have been computer simulated to evaluate the effectiveness of the transformations.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.