Automatic Multiple Articulator Segmentation in Dynamic Speech MRI Using a Protocol Adaptive Stacked Transfer Learning U-NET Model.

Subin Erattakulangara,Sarv Priya,Karthika Kelat,Sajan Goud Lingala,David Meyer

doi:10.3390/bioengineering10050623

Subin Erattakulangara, Sarv Priya + Show 3 more

Open Access

https://doi.org/10.3390/bioengineering10050623

Copy DOI

Journal: Bioengineering	Publication Date: May 22, 2023
Citations: 2	License type: CC BY 4.0

Affiliation: University of Iowa, Shenandoah University

Abstract

Dynamic magnetic resonance imaging has emerged as a powerful modality for investigating upper-airway function during speech production. Analyzing the changes in the vocal tract airspace, including the position of soft-tissue articulators (e.g., the tongue and velum), enhances our understanding of speech production. The advent of various fast speech MRI protocols based on sparse sampling and constrained reconstruction has led to the creation of dynamic speech MRI datasets on the order of 80-100 image frames/second. In this paper, we propose a stacked transfer learning U-NET model to segment the deforming vocal tract in 2D mid-sagittal slices of dynamic speech MRI. Our approach leverages (a) low- and mid-level features and (b) high-level features. The low- and mid-level features are derived from models pre-trained on labeled open-source brain tumor MR and lung CT datasets, and an in-house airway labeled dataset. The high-level features are derived from labeled protocol-specific MR images. The applicability of our approach to segmenting dynamic datasets is demonstrated in data acquired from three fast speech MRI protocols: Protocol 1: 3 T-based radial acquisition scheme coupled with a non-linear temporal regularizer, where speakers were producing French speech tokens; Protocol 2: 1.5 T-based uniform density spiral acquisition scheme coupled with a temporal finite difference (FD) sparsity regularization, where speakers were producing fluent speech tokens in English, and Protocol 3: 3 T-based variable density spiral acquisition scheme coupled with manifold regularization, where speakers were producing various speech tokens from the International Phonetic Alphabetic (IPA). Segments from our approach were compared to those from an expert human user (a vocologist), and the conventional U-NET model without transfer learning. Segmentations from a second expert human user (a radiologist) were used as ground truth. Evaluations were performed using the quantitative DICE similarity metric, the Hausdorff distance metric, and segmentation count metric. This approach was successfully adapted to different speech MRI protocols with only a handful of protocol-specific images (e.g., of the order of 20 images), and provided accurate segmentations similar to those of an expert human.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Automatic Multiple Articulator Segmentation in Dynamic Speech MRI Using a Protocol Adaptive Stacked Transfer Learning U-NET Model.

Abstract

Talk to us

Similar Papers

More From: Bioengineering

Lead the way for us

Similar Papers

Prospectively accelerated dynamic speech magnetic resonance imaging at 3 T using a self-navigated spiral-based manifold regularized scheme.
Rushdi Zahid Rusho ... Wahidul Alam
NMR in biomedicine | VOL. 37
Rushdi Zahid Rusho, et. al.Rushdi Zahid Rusho ... Wahidul Alam
05 Mar 2024
NMR in biomedicine | VOL. 37

A spine segmentation method based on scene aware fusion network
Elzat Elham Yilizati-Yilihamu ... Shiqing Feng
BMC Neuroscience | VOL. 24
Elzat Elham Yilizati-Yilihamu, et. al.Elzat Elham Yilizati-Yilihamu ... Shiqing Feng
14 Sep 2023
BMC Neuroscience | VOL. 24

Comparison of a fast 5-min knee MRI protocol with a standard knee MRI protocol: a multi-institutional multi-reader study.
Erin Fitzgerald Alaia ... I-Yuan Joseph Chang
Skeletal Radiology | VOL. 47
Erin Fitzgerald Alaia, et. al.Erin Fitzgerald Alaia ... I-Yuan Joseph Chang
26 Sep 2017
Skeletal Radiology | VOL. 47

A multi-level feature fusion method based on pooling and similarity for HRRS image retrieval
Yun Ge ... Famao Ye
Remote Sensing Letters | VOL. 12
Yun Ge, et. al.Yun Ge ... Famao Ye
24 Aug 2021
Remote Sensing Letters | VOL. 12

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Automatic Multiple Articulator Segmentation in Dynamic Speech MRI Using a Protocol Adaptive Stacked Transfer Learning U-NET Model.

Abstract

Talk to us

Similar Papers

More From: Bioengineering