Articulatory feature extraction from ultrasound images using pretrained convolutional neural networks

Kele Xu,Jian Zhu

doi:10.1121/1.5068358

Abstract

Feature extraction is of great importance to ultrasound tongue image analysis. Inspired by the recent success of deep learning, we explore a novel approach to feature extraction from ultrasound tongue images using pre-trained convolutional neural networks (CNN). The bottleneck features from different pre-trained CNNs, including VGGNet and ResNet, are used as representations of the ultrasound tongue images. Then an image classification task is conducted to assess the effectiveness of CNN-based features. Our dataset consists of 20,000 ultrasound tongue images collected from a female speaker of Mandarin Chinese, which were manually labeled as containing one of the following consonants: /p, t, k, l/. Experiment results show that the Gradient Boost Machines (GBM) classifiers trained on the CNN-based features achieve the best performance, with a classification accuracy of 92.4% for ResNet and 91.6% for VGGNet, outperforming the benchmark GBM classifier trained on the features extracted using Principal Component Analysis (PCA), which only achieves an accuracy of 87.5%. In this preliminary dataset, our method of feature extraction is found to be superior to the PCA-based method. This work demonstrates the potential of applying the pre-trained convolutional neural networks to ultrasound tongue image analysis task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Articulatory feature extraction from ultrasound images using pretrained convolutional neural networks

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America

Lead the way for us

Journal: The Journal of the Acoustical Society of America	Publication Date: Sep 1, 2018
Citations: 1

Similar Papers

Identify the triple-negative and non-triple-negative breast cancer by using texture features of medicale ultrasonic image: A STROBE-compliant study.
Qingyu Chen ... Jianguo Xia
Medicine | VOL. 100
Qingyu Chen, et. al.Qingyu Chen ... Jianguo Xia
04 Jun 2021
Medicine | VOL. 100

Automatic Detection of Renal Abnormalities by Off-the-shelf CNN Features
Priyanka Kokil ... S Sudharson
IETE Journal of Education | VOL. 60
Priyanka Kokil, et. al.Priyanka Kokil ... S Sudharson
02 Jan 2019
IETE Journal of Education | VOL. 60

Ultra2Speech - A Deep Learning Framework for Formant Frequency Estimation and Tracking from Ultrasound Tongue Images
Pramit Saha ... Bryan Gick
-
Pramit Saha, et. al.Pramit Saha ... Bryan Gick
01 Jan 2020
01 Jan 2020

Parametrical modelling for texture characterization-A novel approach applied to ultrasound thyroid segmentation.
Alfredo Illanes ... Nazila Esmaeili
PloS one | VOL. 14
Alfredo Illanes, et. al.Alfredo Illanes ... Nazila Esmaeili
29 Jan 2019
PloS one | VOL. 14

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Articulatory feature extraction from ultrasound images using pretrained convolutional neural networks

Abstract

Talk to us

Similar Papers

More From: The Journal of the Acoustical Society of America