Abstract
The speech therapy is essential during the whole treatment of Cleft Palate (CP) over many years. The automatic evaluation of CP speech could provide an effective assistant diagnosis. The glottal stop is a typical compensatory articulation error in CP speech. It is produced by adducting the vocal folds as a substitution during the pronunciation of oral pressure consonants. The existence of glottal stop has a great impact on speech intelligibility. In this work, an automatic glottal stop detection system is proposed. The CP speech database is collected by the Hospital of Stomatology, Sichuan University, which has the largest number of CP patients in China. This database includes extensive CP speech samples annotated by speech-language pathologists at the phoneme level. The vocabulary of this database includes all the initial consonants in Mandarin. The automatic initials and finals segmentation method is proposed firstly. Then, for the pressure initial consonants, four types of acoustic features are extracted: Mel-frequency cepstral coefficients, formants, Gammatone filtering energy feature, wavelet packet energy and Shannon entropy features. The features are processed using a k-nearest neighbor classifier. The k-fold cross validation is used to calculate the average detection accuracy, which reaches up to 93.5% using Gammatone filtering energy feature.
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have