Improved features using convolution-augmented transformers for keyword spotting

Yi Wang,Qiang Chen,Junan Yang,Song Li,Jingtao Liu,L Nguyen

doi:10.1051/itmconf/20224702039

Improved features using convolution-augmented transformers for keyword spotting

Yi Wang, Qiang Chen + Show 4 more

Open Access

https://doi.org/10.1051/itmconf/20224702039

Copy DOI

Journal: ITM Web of Conferences	Publication Date: Jan 1, 2022
License type: CC BY 4.0

Affiliation: PLA 306 Hospital

#Local Feature #Local Patterns + Show 4 more

Abstract
Full-Text PDF
Similar Papers

Abstract

Transformer can effectively model long rang dependency, but suffer from uncapable to extract local feature patterns. While CNNs exploit local features effectively. In this paper, we seek to combine convolution and Transformers improves over using them individually, and propose improved features using convolution-augmented transformers for keyword spotting. The convolution-augmented transformers are constructed with a ResNet front-end and a convolution-augmented transformers back-end in series. Using this improved feature for keyword spotting task. The results show that the improved features using convolution- augmented transformers can yield at least 3% improvement compared with other features.

Full Text