Acoustic Word Embedding Based on Multi-Head Attention Quadruplet Network

Shirong Zhu,Kai He,Lasheng Zhao,Ying Zhang

doi:10.1109/lsp.2021.3129702

Shirong Zhu, Kai He + Show 2 more

Open Access

https://doi.org/10.1109/lsp.2021.3129702

Copy DOI

Journal: IEEE Signal Processing Letters	Publication Date: Jan 1, 2022
Citations: 1	License type: publisher-specific, author manuscript

Affiliation: Dalian University

Abstract

Acoustic word embedding (AWE) has become a mainstream method in low-resource Query-by-Example keywords search. This paper proposes an AWE based on a multi-head attention quadruplet network, which can learn the attention weight sequence for all time frames of bidirectional Long Short-Term Memory by a multi-head self-attentive mechanism to pay attention to the time position information. At the same time, we construct a differences order quadruplet loss to train the AWE model to adequately consider the relative and absolute distances between the positive and negative sample pairs. In addition, attention mechanism, differences order quadruplet loss, and word label information are combined to design an objective function so that the AWE vectors have a better feature expression in the embedded space. The experimental results show that the proposed method can improve the learning ability of the network and make the AWEs more identifiable. The above two points result in better performance in the word discrimination task.

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

R Discovery Prime

Acoustic Word Embedding Based on Multi-Head Attention Quadruplet Network

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters

Lead the way for us

Similar Papers

Self-supervised dual-head attentional bootstrap learning network for prostate cancer screening in transrectal ultrasound images
Xu Lu ... Shaopeng Liu
Computers in Biology and Medicine | VOL. 165
Xu Lu, et. al.Xu Lu ... Shaopeng Liu
12 Aug 2023
Computers in Biology and Medicine | VOL. 165

Diffusion tensor tractography measurement of the distance between corticospinal tracts in patients with spontaneous intraventricular haemorrhage.
Sung Ho Jang ... Han Do Lee
The Journal of international medical research | VOL. 44
Sung Ho Jang, et. al.Sung Ho Jang ... Han Do Lee
07 Dec 2015
The Journal of international medical research | VOL. 44

Perception of absolute and relative distances in stereoscopic image
Kazunori Shidoji ... Masahiko Ogawa
-
Kazunori Shidoji, et. al.Kazunori Shidoji ... Masahiko Ogawa
04 Feb 2010
04 Feb 2010

Solving Inefficiency of Self-supervised Representation Learning
Guangrun Wang ... Philip H.S Torr
-
Guangrun Wang, et. al.Guangrun Wang ... Philip H.S Torr
01 Oct 2021
01 Oct 2021

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

Acoustic Word Embedding Based on Multi-Head Attention Quadruplet Network

Abstract

Talk to us

Similar Papers

More From: IEEE Signal Processing Letters