Abstract
Singing voice is a human quality that requires the precise coordination of numerous kinetic functions and results in a perceptually variable auditory outcome. The use of multi-sensor systems can facilitate the study of correlations between the vocal mechanism kinetic functions and the voice output. This is directly relevant to vocal education, rehabilitation, and prevention of vocal health issues in educators; professionals; and students of singing, music, and acting. In this work, we present the initial design of a modular multi-sensor system for singing voice analysis, and describe its first assessment experiment on the ‘vocal breathiness’ qualitative characteristic. A system case study with two professional singers was conducted, utilizing signals from four sensors. Participants sung a protocol of vocal trials in various degrees of intended vocal breathiness. Their (i) vocal output, (ii) phonatory function, and (iii) respiratory behavior-per-condition were recorded through a condenser microphone (CM), an Electroglottograph (EGG), and thoracic and abdominal respiratory effort transducers (RET), respectively. Participants’ individual respiratory management strategies were studied through qualitative analysis of RET data. Microphone audio samples breathiness degree was rated perceptually, and correlation analysis was performed between sample ratings and parameters extracted from CM and EGG data. Smoothed Cepstral Peak Prominence (CPPS) and vocal folds’ Open Quotient (OQ), as computed with the Howard method (HOQ), demonstrated the higher correlation coefficients, when analyzed individually. DECOM method-computed OQ (DOQ) was also examined. Interestingly, the correlation coefficient of pitch difference between estimates from CM and EGG signals appeared to be (based on the Pearson correlation coefficient) statistically insignificant (a result that warrants investigation in larger populations). The study of multi-variate models revealed even higher correlation coefficients. Models studied were the Acoustic Breathiness Index (ABI) and the proposed multiple regression model CDH (CPPS, DOQ, and HOQ), which was attempted in order to combine analysis results from microphone and EGG signals. The model combination of ABI and the proposed CDH appeared to yield the highest correlation with perceptual breathiness ratings. Study results suggest potential for the use of a completed system version in vocal pedagogy and research, as the case study indicated system practicality, a number of pertinent correlations, and introduced topics with further research possibilities.
Highlights
Singing is the result of certain neuromuscular functions pertaining to the vocal mechanism of the human body
We observe a decrease in the average CPPS values as the breathiness rates are increasing
We can note that the Open Quotient (OQ) values increase as the breathiness rate increase (0.5285 for breathiness = 0 and 0.6473 for breathiness = 4, using the Howard algorithm) and that the use of Howard algorithm yielded higher values compared to the DECOM method
Summary
Singing is the result of certain neuromuscular functions pertaining to the vocal mechanism of the human body. Singing voice acoustic and perceptual properties have been studied since Ancient Greek times, mainly by Aristotle, Aristoxenus and Dionysius of Halicarnassus, and subsequently by Porphyrios, Nicomachus, and Cleonides [1]. This domain’s study has been continuing, and a plethora of modern-day research works have been actualized regarding its anatomical, physiological, and acoustical properties [2], there is still much to be uncovered [3]. One of the features that seems to require further investigation is singing voice overall quality assessment, which entails the quantification of individual perceptual voice attributes that define it, as “no agreed-upon method currently exists for objective measurement of perceived voice quality” [4] The quantification and understanding of such qualitative measures is important to vocal education, as well as to early diagnosis of voice misuse and prevention of relevant vocal health issues in professional singing voice users (singers, actors, music teachers, etc.)
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.