Normal Phonation Research Articles

Dynamic imaging of the vocal tract using real-time MRI has been an active and growing area of research, having demonstrated great potential to become routinely performed in the clinical evaluation of speech and swallowing disorders. Although many technical advances have been made in regards to acquisition and reconstruction methodologies, there is still no consensus in best practice protocols. This study aims to compare Cartesian and non-Cartesian real-time MRI sequences, regarding image quality and temporal resolution trade-off, for dynamic speech imaging. Five subjects were imaged at 1.5T, while performing normal phonation, in order to assess velar motion and velopharyngeal closure. Data was acquired using both Cartesian and non-Cartesian (spiral and radial) real-time sequences at five different spatial-temporal resolution sets, between 10 fps (1.7×1.7×10 mm3) and 25 fps (1.5×1.5×10 mm3). Only standard scanning resources provided by the MRI scanner manufacturer were used to ensure easy applicability to clinical evaluation and reproducibility. Data sets were evaluated by comparing measurements of the velar structure, dynamic contrast-to-noise ratio and image quality visual scoring. Results showed that for all proposed sequences, FLASH spiral acquisitions provided higher contrast-to-noise ratio, up to a 170.34% increase at 20 fps, than equivalent bSSFP Cartesian acquisitions for the same spatial-temporal resolution. At higher frame rates (22 and 25 fps), spiral protocols were optimal and provided higher CNR and visual scoring than equivalent radial protocols. Comparison of dynamic imaging at 10 and 22 fps for radial and spiral acquisitions revealed no significant difference in CNR performance, thus indicating that temporal resolution can be doubled without compromising spatial resolution (1.9×1.9 mm2) or CNR. In summary, this study suggests that the use of FLASH spiral protocols should be preferred over bSSFP Cartesian for the dynamic imaging of velopharyngeal closure, as it allows for an improvement in CNR and overall image quality without compromising spatial-temporal resolution.

Sir, It was with great interest that we read the article titled “Feasibility study to assess clinical applications of 3T cine MRI coupled with synchronous audio recording during speech in evaluation of velopharyngeal insufficiency in children” by Sagar and Nimkin [1]. We believe dynamic real-time MRI will become a prevalent tool in studying velopharyngeal closure and, when associated with detailed anatomical scans, should provide added clinical information compared to the current imaging modalities, videofluoroscopy and nasendoscopy. In this regard, the authors must be commended for their detailed comparison of real-time MRI with those techniques. However, it is worth highlighting that contrary to what is stated in the article, many studies of velopharyngeal closure using real-time MRI have been published in the imaging literature, most of them with synchronised audio recording. Studies focussing specifically on velopharyngeal closure include but are not limited to those by Beer et al. [2], which compared the technique to X-ray videofluoroscopy, Bae et al. [3] at 3 T, and Scott et al. [4], which included data at both 1.5 and 3 T. The last two studies were carried out with simultaneous audio recording. There are far too many speech studies using real-time MRI to mention in a short letter, although the work conducted at the University of Southern California both on acquisition and analysis [5, 6] is worth highlighting. We strongly encourage the interested reader to refer to the recent review of the field [7]. When studying velopharyngeal closure, the dynamic frame rate is a key issue, and we are concerned that only two frames per second (fps) were used in the study by Sagar and Nimkin [1]. Although there is no doubt this is sufficient to study sustained phonation, it appears to be insufficient for speech studies. All the previously mentioned studies [2–6] were acquired at substantially faster rates (5–25 fps). In our clinical experience, for various speech samples in normal phonation all closure events are detected at rates around 10 fps [8]; however, some are already missed at 5 fps. A lower frame rate will increase both blurring and the number of missed closure events; it could potentially lead to an incorrect diagnosis of velopharyngeal insufficiency if closure is short and not sampled. Higher frame rates (e.g., 30 fps [9] or more than 100 fps [10]) are achievable and can be required for linguistics and co-articulatory events studies. However, they are obtained using non-cartesian acquisitions, which have been recently developed and are not necessarily available on standard scanners. Furthermore, they usually rely on delayed-reconstruction, which is a limiting factor to conduct an interactive clinical speech study as would be desirable for velopharyngeal closure assessment. Real-time MRI of speech is a very active field of research, and new publications in this area are a welcome addition to the body of knowledge. However, a consensus on best practise for acquisition methodology is still to emerge, and until such time, we would recommend newcomers to err on the side of caution and try to obtain the best spatial-temporal resolution compromise that can be achieved with their chosen acquisition method.

Normal Phonation Research Articles

Related Topics

Articles published on Normal Phonation

The Low Mandible Maneuver: Preliminary Study of Its Effects on Aerodynamic and Acoustic Measures

Assessment of vocal fold mobility using dynamic magnetic resonance imaging and ultrasound in healthy volunteers.

OnabotulinumtoxinA for adductor spasmodic dysphonia (ADSD): Functional results and the role of dosage

Estimation of the glottal flow from speech pressure signals: Evaluation of three variants of iterative adaptive inverse filtering using computational physical modelling of voice production

Relationship between intraglottal geometry, vocal tract constriction, and glottal flow during phonation of a canine larynx

Impact Stress in Water Resistance Voice Therapy: A Physical Modeling Study

Aeroacoustic analysis of the human phonation process based on a hybrid acoustic PIV approach

Contact Quotient of Female Singers Singing Four Pitches for Five Vowels in Normal and Pressed Phonations

Effect of the ventricular folds in a synthetic larynx model

Comparison of Cartesian and Non-Cartesian Real-Time MRI Sequences at 1.5T to Assess Velar Motion and Velopharyngeal Closure during Speech.

Clinical Analysis of Hoarseness in Children as Seen in Otorhinolaryngology Department of a Tertiary Health Institution in North-West, Nigeria

Detection of Voice Pathology using Fractal Dimension in a Multiresolution Analysis of Normal and Disordered Speech Signals.

Modeling the effects of a posterior glottal opening on vocal fold dynamics with implications for vocal hyperfunction.

Evaluating velopharyngeal closure with real-time MRI

Acoustic- and EGG-parametrisations of Phonatory Quality Provide Voice Profiles of Normal Speakers

Spasmodic Dysphonia: A 20‐Year Experience of Injecting Botulinum Toxin

Spectral Analysis of Digital Kymography in Normal Adult Vocal Fold Vibration

Outcome of the Vocal Cord function after Partial Layer Resection of the Recurrent Laryngeal Nerve in Patients with Invasive Papillary Thyroid Cancer

Revisiting the two-mass model of the vocal folds

Nonlinearities in block-type reduced-order vocal fold models with asymmetric tissue properties

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Normal Phonation Research Articles

Related Topics

Articles published on Normal Phonation

The Low Mandible Maneuver: Preliminary Study of Its Effects on Aerodynamic and Acoustic Measures

Assessment of vocal fold mobility using dynamic magnetic resonance imaging and ultrasound in healthy volunteers.

OnabotulinumtoxinA for adductor spasmodic dysphonia (ADSD): Functional results and the role of dosage

Estimation of the glottal flow from speech pressure signals: Evaluation of three variants of iterative adaptive inverse filtering using computational physical modelling of voice production

Relationship between intraglottal geometry, vocal tract constriction, and glottal flow during phonation of a canine larynx

Impact Stress in Water Resistance Voice Therapy: A Physical Modeling Study

Aeroacoustic analysis of the human phonation process based on a hybrid acoustic PIV approach

Contact Quotient of Female Singers Singing Four Pitches for Five Vowels in Normal and Pressed Phonations

Effect of the ventricular folds in a synthetic larynx model

Comparison of Cartesian and Non-Cartesian Real-Time MRI Sequences at 1.5T to Assess Velar Motion and Velopharyngeal Closure during Speech.

Clinical Analysis of Hoarseness in Children as Seen in Otorhinolaryngology Department of a Tertiary Health Institution in North-West, Nigeria

Detection of Voice Pathology using Fractal Dimension in a Multiresolution Analysis of Normal and Disordered Speech Signals.

Modeling the effects of a posterior glottal opening on vocal fold dynamics with implications for vocal hyperfunction.

Evaluating velopharyngeal closure with real-time MRI

Acoustic- and EGG-parametrisations of Phonatory Quality Provide Voice Profiles of Normal Speakers

Spasmodic Dysphonia: A 20‐Year Experience of Injecting Botulinum Toxin

Spectral Analysis of Digital Kymography in Normal Adult Vocal Fold Vibration

Outcome of the Vocal Cord function after Partial Layer Resection of the Recurrent Laryngeal Nerve in Patients with Invasive Papillary Thyroid Cancer

Revisiting the two-mass model of the vocal folds

Nonlinearities in block-type reduced-order vocal fold models with asymmetric tissue properties