Abstract

Speech rhythm is one of the main prosodic features of spoken languages. The quantitative research data on this topic in the literature are not numerous may be due to the confusing divergence in this concept definition. In order to render clear this concept, we investigated a speech segment in Mandarin. Due to its delicate temporal structure, it leads easily to the semantic ambiguities, well known in speech recognition. In order to understand how human manage the speech meaning in a natural language processing, we designed a series of lexical duration and pause interval modulations based on the speech segment for production. By analyzing the resultant acoustic patterns, we observed two types of temporal grouping. The type 1 is context dependent, the time range has no regularity, the modulations suffered from semantic ambiguities. The type 2 includes the stable patterns presenting temporal regularities. We consider the temporal structures in this type as speech rhythm. This definition corresponds to the articulatory periodicity that produces the consonant and vowel alternations. They are context independent and reproducible.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.