Doppler Radar-Based Human Speech Recognition Using Mobile Vision Transformer

Wei Li,Qining Ding,Dandan Li,Nanqi Liu,Yang Gao,Yongfu Geng,Jinheng Chen

doi:10.3390/electronics12132874

Wei Li, Qining Ding + Show 5 more

Open Access

https://doi.org/10.3390/electronics12132874

Copy DOI

Journal: Electronics	Publication Date: Jun 29, 2023
Citations: 2	License type: CC BY 4.0

Affiliation: North China University of Technology

Abstract

As one of the important vital features of the human body, the acquisition of a speech signal plays an important role in human–computer interaction. In this study, voice sounds are gathered and identified using Doppler radar. The skin on the neck vibrates when a person speaks, which causes the vocal cords to vibrate as well. The vibration signal received by the radar will produce a unique micro-Doppler signal according to words with different pronunciations. Following the conversion of these signals into micro-Doppler feature maps, these speech signal maps are categorized and identified. The speech recognition method used in this paper is on neural networks. CNN convolutional neural networks have a lower generalization and accuracy when there are insufficient training samples and sample extraction bias, and the training model is not suitable for use on mobile terminals. MobileViT is a lightweight transformers-based model that can be used for image classification tasks. MobileViT uses a lightweight attention mechanism to extract features with a faster inference speed and smaller model size while ensuring a higher accuracy. Our proposed method does not require large-scale data collection, which is beneficial for different users. In addition, the learning speed is relatively fast, with an accuracy of 99.5%.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Doppler Radar-Based Human Speech Recognition Using Mobile Vision Transformer

Abstract

Talk to us

Similar Papers

More From: Electronics

Lead the way for us

Similar Papers

Model-Efficient TTS
Xu Tan
-
Xu TanXu Tan
01 Jan 2023
01 Jan 2023

A Latent Variable Augmentation Method for Image Categorization with Insufficient Training Samples
Luyue Lin ... Bo Liu
ACM Transactions on Knowledge Discovery from Data | VOL. 16
Luyue Lin, et. al.Luyue Lin ... Bo Liu
20 Jul 2021
ACM Transactions on Knowledge Discovery from Data | VOL. 16

Towards Accurate DGA Detection based on Siamese Network with Insufficient Training Samples
Xiaoyan Hu ... Miao Li
-
Xiaoyan Hu, et. al.Xiaoyan Hu ... Miao Li
16 May 2022
16 May 2022

System and method for speech recognition
John Marley
The Journal of the Acoustical Society of America | VOL. 74
John MarleyJohn Marley
01 Aug 1983
The Journal of the Acoustical Society of America | VOL. 74

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Doppler Radar-Based Human Speech Recognition Using Mobile Vision Transformer

Abstract

Talk to us

Similar Papers

More From: Electronics