Chinese Tone Recognition Based on 3D Dynamic Muscle Information

Jianrong Wang ,Ju Zhang,Jing Hu,Fan Yang,Li Wan,Qiang Fang

doi:10.1155/2020/5476896

Jianrong Wang , Ju Zhang + Show 4 more

Open Access

PDF Available

https://doi.org/10.1155/2020/5476896

Copy DOI

Export

Save

Cite

Abstract
Highlights/Summary
Full-Text PDF
Similar Papers

Abstract

Listen

To advance the study of lip-reading recognition in accordance with Chinese pronunciation norms, we carefully investigated Mandarin tone recognition based on visual information, in contrast to that of the previous character-based Chinese lip reading technique. In this paper, we mainly studied the vowel tonal transformation in Chinese pronunciation and designed a lightweight skipping convolution network framework (SCNet). And, the experimental results showed that the SCNet was sensitive to the more detailed description of the pitch change than that of the traditional model and achieved a better tone recognition effect and outstanding antiinterference performance. In addition, we conducted a more detailed study on the assistance of the deep texture information in lip-reading recognition. We found that the deep texture information has a significant effect on tone recognition, and the possibility of multimodal lip reading in Chinese tone recognition was confirmed. Similarly, we verified the role of the SCNet syllable tone recognition and found that the vowel and syllable tone recognition accuracy of our model was as high as 97.3%, which also showed the robustness of our proposed method for Chinese tone recognition and it can be widely used for tone recognition.

Highlights

The superior performance of lip reading in robust speech recognition has received widespread attention. e goal of lip reading is to improve the robustness of speech recognition in special situations such as low signalnoise ratio (SNR) or silent environments
We focus on the study of the vowel tonal changes in Chinese pronunciation
(1) For Chinese pronunciation tonal changes, we propose a new lightweight network framework, the skipping convolution network framework (SCNet), which is more sensitive to the transformation of details compared with the traditional network architecture

Summary

Introduction

The superior performance of lip reading in robust speech recognition has received widespread attention. e goal of lip reading is to improve the robustness of speech recognition in special situations such as low signalnoise ratio (SNR) or silent environments. Pixel-based methods extract visual features from the image directly or after some preprocessing and transformation. Model-based methods utilize low dimensional features to express image features, and the feature is typically not changed by factors such as translation, rotation, scaling, or illumination Both methods extract relevant information directly from the region of interest (ROI) in the planar image [8]. Wang et al [13] used 3D lip points obtained from Kinect, improving the performance of multimodal speech recognition Studies by these pioneers have demonstrated the effectiveness of depth information in lip-reading recognition. E currently proposed lip-reading recognition based on 3D depth information does not consider the inherent texture problem of driving the lip motion during natural speech changes.

Data Collection and Feature Preprocessing

Feature Preprocessing

Network Architecture

Experiments and Results

Method

50 Gassus

Full Text

Published Version (Free)

View/Download pdf

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Chinese Tone Recognition Based on 3D Dynamic Muscle Information

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Discrete Dynamics in Nature and Society

Lead the way for us

Journal: Discrete Dynamics in Nature and Society	Publication Date: May 31, 2020
License type: CC BY 4.0

Similar Papers

Enhancing Chinese tone recognition by manipulating amplitude envelope: Implications for cochlear implants
Xin Luo ... Qian-Jie Fu
The Journal of the Acoustical Society of America | VOL. 116
Xin Luo, et. al.Xin Luo ... Qian-Jie Fu
01 Dec 2004
The Journal of the Acoustical Society of America | VOL. 116

The Effectiveness of Linear Prediction Residual to the Verification of Voiceprint and the Recognition of Chinese Tone
Wei-Chih Hsu ... Juan-Nan Sun
-
Wei-Chih Hsu, et. al.Wei-Chih Hsu ... Juan-Nan Sun
01 Dec 2010
01 Dec 2010

Concurrent-vowel and tone recognition by Mandarin-speaking cochlear implant users
Xin Luo ... Chuan-Jen Hsu
Hearing Research | VOL. 256
Xin Luo, et. al.Xin Luo ... Chuan-Jen Hsu
10 Jul 2009
Hearing Research | VOL. 256

Effects of Speech Processing Strategy on Chinese Tone Recognition by Nucleus-24 Cochlear Implant Users
Qian-Jie Fu ... Mei-Ji Horng
Ear and Hearing | VOL. 25
Qian-Jie Fu, et. al.Qian-Jie Fu ... Mei-Ji Horng
01 Oct 2004
Ear and Hearing | VOL. 25

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Chinese Tone Recognition Based on 3D Dynamic Muscle Information

Abstract

Highlights

Summary

Published Version (Free)

Talk to us

Similar Papers

More From: Discrete Dynamics in Nature and Society