Abstract

The extrema of the logarithmic derivative of the mean energy of a voice signal in the frequency range of 1000–3000 Hz are used to determine the instants of opening and closure of the glottis. The inaccuracy of analysis is estimated with the Arctic CMU database, which contains synchronous recordings of speech signals and electro-glottograms. The estimates of the instants of opening and closure of the glottis, found by the developed algorithm, are compared with the instants of the maximum and minimum of the derivative from electro-glottogram signals, which are taken as the “true” instants. The mean square deviation of the glottal opening instant from the extrema of the derivative from the electro-glottogram signals for different speakers is in the range of 1.03–1.64 ms. The error rate of a false estimate of the glottal opening instant is from 0.01 to 0.14%, and the error rate of omission is from 0.42 to 2.38%. An error-detection algorithm is developed. The mean square deviation with an relative—to the period of the fundamental tone—error in detecting the glottal opening instant is in the range of 13–18% for the most probable error from 0 to +5%.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.