Abstract

Abstract Correcting lisps in speech can prove to be of great difficulty to many, as they may be unaware of whether they are lisping. To help those affected, we have developed a simple algorithm for the real-time identification of the sigmatismus lateralis in ”S” sounds within speech via analysis in the frequency domain. The algorithm identifies peaks within the lisp’s frequency band after calibration. A frequency band of 3000-4000 Hz has been identified to be generally accurate for lisp and 2500-3000 Hz for the correct pronunciation for a single male test subject. The algorithm splits given speech recordings into smaller segments and compares the number of lisps and non-lisps detected in these segments to categorize. From tests, it was concluded that a segment length of 0.5 s produces the best results. The algorithm does not detect every lisp section, however it does not raise false positives. Our implementation in Julia with multi-threaded per-file analysis is able to analyze 20 files of lengths between 5 s and 10 s within 0.21 s on a Qualcomm Snapdragon 860 smartphone chipset, meaning analysis is far faster than real-time. The proposed algorithm is a simple prototype algorithm capable of realtime analysis of audio in the frequency domain to identify whether lateral lisps are the dominant sibilant pronunciation in a given window. The method was only tested for a single test subject. However, a calibration algorithm capable of adjusting parameters to new individuals is proposed. The algorithm itself should be easily expandable to identify other speech impediments.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.