Comparison of DCT and autoencoder-based features for DNN-HMM multimodal silent speech recognition

Licheng Liu,Bruce Denby,Yan Ji,Hongcui Wang

doi:10.1109/iscslp.2016.7918434

Abstract

Hidden Markov Model and Deep Neural Network-Hidden Markov Model speech recognition performance for a portable ultrasound + video multimodal silent speech interface is investigated using Discrete Cosine Transform and Deep Auto Encoder-based features with a range of dimensionalities. Experimental results show that the two types of features achieve similar Word Error Rate, but that the autoencoder features maintain good performance even for very low-dimension feature vectors, demonstrating potential as a very compact representation of the information in multimodal silent speech data. It is also shown for the first time that the Deep Network/Markov approach, which has been demonstrated to be beneficial for acoustic speech recognition and for articulatory sensor-based silent speech, improves the silent speech recognition performance for video-based silent speech recognition as well.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Comparison of DCT and autoencoder-based features for DNN-HMM multimodal silent speech recognition

Abstract

Talk to us

Similar Papers

Lead the way for us

Similar Papers

Silent versus Modal Multi-Speaker Speech Recognition from Ultrasound and Video
Manuel Sam Ribeiro ... Korin Richmond
-
Manuel Sam Ribeiro, et. al.Manuel Sam Ribeiro ... Korin Richmond
30 Aug 2021
30 Aug 2021

A hybrid speech recognition model based on HMM and fuzzy PPM
P Bao ... A Sim
-
P Bao, et. al.P Bao ... A Sim
11 Oct 1998
11 Oct 1998

은닉 마르코프 모형을 이용한 회전체 결함신호의 패턴 인식
...
Transactions of the Korean Society of Mechanical Engineers A | VOL. 27
, et. al. ...
01 Nov 2003
Transactions of the Korean Society of Mechanical Engineers A | VOL. 27

Ecofriendly and high-performance flexible pressure sensor derived from natural plant materials for intelligent audible and silent speech recognition
Xuqi Zheng ... Minghui Cao
Nano Energy | VOL. 126
Xuqi Zheng, et. al.Xuqi Zheng ... Minghui Cao
06 May 2024
Nano Energy | VOL. 126

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Comparison of DCT and autoencoder-based features for DNN-HMM multimodal silent speech recognition

Abstract

Talk to us

Similar Papers