Singing Voice Analysis Research Articles

Singing voice is a human quality that requires the precise coordination of numerous kinetic functions and results in a perceptually variable auditory outcome. The use of multi-sensor systems can facilitate the study of correlations between the vocal mechanism kinetic functions and the voice output. This is directly relevant to vocal education, rehabilitation, and prevention of vocal health issues in educators; professionals; and students of singing, music, and acting. In this work, we present the initial design of a modular multi-sensor system for singing voice analysis, and describe its first assessment experiment on the ‘vocal breathiness’ qualitative characteristic. A system case study with two professional singers was conducted, utilizing signals from four sensors. Participants sung a protocol of vocal trials in various degrees of intended vocal breathiness. Their (i) vocal output, (ii) phonatory function, and (iii) respiratory behavior-per-condition were recorded through a condenser microphone (CM), an Electroglottograph (EGG), and thoracic and abdominal respiratory effort transducers (RET), respectively. Participants’ individual respiratory management strategies were studied through qualitative analysis of RET data. Microphone audio samples breathiness degree was rated perceptually, and correlation analysis was performed between sample ratings and parameters extracted from CM and EGG data. Smoothed Cepstral Peak Prominence (CPPS) and vocal folds’ Open Quotient (OQ), as computed with the Howard method (HOQ), demonstrated the higher correlation coefficients, when analyzed individually. DECOM method-computed OQ (DOQ) was also examined. Interestingly, the correlation coefficient of pitch difference between estimates from CM and EGG signals appeared to be (based on the Pearson correlation coefficient) statistically insignificant (a result that warrants investigation in larger populations). The study of multi-variate models revealed even higher correlation coefficients. Models studied were the Acoustic Breathiness Index (ABI) and the proposed multiple regression model CDH (CPPS, DOQ, and HOQ), which was attempted in order to combine analysis results from microphone and EGG signals. The model combination of ABI and the proposed CDH appeared to yield the highest correlation with perceptual breathiness ratings. Study results suggest potential for the use of a completed system version in vocal pedagogy and research, as the case study indicated system practicality, a number of pertinent correlations, and introduced topics with further research possibilities.

Read full abstract

Soprano singers face a number of specific challenges when singing vowels at high frequencies, due to the wide spacing of harmonics in the voice source. The varied and complex techniques used to overcome these are still not fully understood. Magnetic resonance imaging (MRI) has become increasingly popular in recent years for singing voice analysis. This study proposes a new protocol using three-dimensional MRI to investigate the articulatory parameters relevant to resonance tuning, a technique whereby singers alter their vocal tract to shift its resonances nearer to a voice source harmonic, increasing the amplitude of the sound produced. The protocol was tested with a single soprano opera singer. Drawing on previous MRI studies, articulatory measurements from three-dimensional MRI images were compared to vocal tract resonances measured directly using broadband noise excitation. The suitability of the protocol was assessed using statistical analysis. No clear linear relationships were apparent between articulatory characteristics and vocal tract resonances. The results were highly vowel dependent, showing different patterns of resonance tuning and interactions between variables. This potentially indicates a complex interaction between the vocal tract and sung vowels in soprano voices, meriting further investigation. The effective interpretation of MRI data is essential for a deeper understanding of soprano voice production and, in particular, the phenomenon of resonance tuning. This paper presents a new protocol that contributes toward this aim, and the results suggest that a more vowel-specific approach is necessary in the wider investigation of resonance tuning in female voices.

Read full abstract

Singing Voice Analysis Research Articles

Articles published on Singing Voice Analysis

Determination of the vocal tract model order in iterative adaptive inverse filtering

Towards a Singing Voice Multi-Sensor Analysis Tool: System Design, and Assessment Based on Vocal Breathiness.

BioVoice: A multipurpose tool for voice analysis

Robust singer identification of Indian playback singers

KaraMIR: A Project for Cover Song Identification and Singing Voice Analysis Using a Karaoke Songs Dataset

Determining the Relevant Criteria for Three-dimensional Vocal Tract Characterization

Singing Voice Separation and Vocal F0 Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation

Practical determination of acoustic parameters of the singing voice implemented in the interactive analysis software EVOCANTO.

A system for parallel measurement of glottis opening and larynx position

Inverse filtering in singing voice: a critical analysis

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Singing Voice Analysis Research Articles

Articles published on Singing Voice Analysis

Determination of the vocal tract model order in iterative adaptive inverse filtering

Towards a Singing Voice Multi-Sensor Analysis Tool: System Design, and Assessment Based on Vocal Breathiness.

BioVoice: A multipurpose tool for voice analysis

Robust singer identification of Indian playback singers

KaraMIR: A Project for Cover Song Identification and Singing Voice Analysis Using a Karaoke Songs Dataset

Determining the Relevant Criteria for Three-dimensional Vocal Tract Characterization

Singing Voice Separation and Vocal F0 Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation

Practical determination of acoustic parameters of the singing voice implemented in the interactive analysis software EVOCANTO.

A system for parallel measurement of glottis opening and larynx position

Inverse filtering in singing voice: a critical analysis