Abstract

Although surface electromyogram recorded from high-density electrode array is believed to carry sufficient spatial information that can benefit the decoding of motor intentions, the complexity of using the array hindered its widespread applications especially in wearable devices. This study is aimed to develop a non-acoustic modality of silent speech recognition that transfers knowledge learned from high-density array to a system using a few channels, with both high portability and performance. A convolutional neural network was established for recognizing a vocabulary of 33 Chinese words during subvocal speech production. The network was trained by the data recorded from face and neck muscles using two arrays with 64 channels in the source domain. Then it was calibrated through a transfer learning approach to grant its adaption to a new target domain with the data recorded by 8 separated electrodes, while its good capability of characterizing subvocal speech word patterns is expected to be maintained. The proposed method significantly outperformed three common classification approaches and the baseline approach without transfer learning (a network trained with data just from the target domain). Under conditions of electrode shift and cross-user variability, it still obtained performance improvements. The method is demonstrated to be viable for transfer learning across domains of electrode settings and it facilitates to improve the performance of silent speech recognition systems using separate electrode sites under the guidance from high-density of arrays.

Full Text
Paper version not known

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call