The Novel Sequence Distance Measuring Algorithm Based on Optimal Transport and Cross-Attention Mechanism

Yanmin Yu,Yongcai Lai,Ping Yan,Haiying Liu

doi:10.1155/2021/3272119

Yanmin Yu, Yongcai Lai + Show 2 more

Open Access

https://doi.org/10.1155/2021/3272119

Copy DOI

Abstract

In this paper, we propose a novel sequence distance measuring algorithm based on optimal transport (OT) and cross-attention mechanism. Given a source sequence and a target sequence, we first calculate the ground distance between each pair of source and target terms of the two sequences. The ground distance is calculated over the subsequences around the two terms. We firstly pay attention from each the source terms to each target terms with attention weights, so that we have a representative source subsequence vector regarding each term in the target subsequence. Then, we pay attention from each representative vector of the term of the target subsequence to the entire source subsequence. In this way, we construct the cross-attention weights and use them to calculate the pairwise ground distances. With the ground distances, we derive the OT distance between the two sequences and train the attention parameters and ground distance metric parameters together. The training process is conducted with training triplets of sequences, where each triplet is composed of an anchor sequence, a must-link sequence, and a cannot-link sequence. The corresponding hinge loss function of each triplet is minimized, and we develop an iterative algorithm to solve the optimal transport problem and the attention/ground distance metric parameters in an alternate way. The experiments over sequence similarity search benchmark datasets, including text, video, and rice smut protein sequence data, are conducted. The experimental results show the algorithm is effective.

Highlights

In natural language processing, a sentence is a sequence of words, and in computer vision, a video is a sequence of frames, while in bioinformatics, a protein structure is a sequence of amino acids in a polypeptide chain
Unlike the flat vector data of most machine learning problems, sequence data has the following inherent features: (1) Sequence data is varying at the number of items. e flat feature is usually given at a fixed size, while the length of the sequences could be different, due to the sampling process to form the sequence
We study the problem of learning effective ground distance between the items of the two sequences for the purpose of sequence distance comparison

Summary

Introduction

Unlike the flat vector data of most machine learning problems, sequence data has the following inherent features:. (1) Sequence data is varying at the number of items. E flat feature is usually given at a fixed size, while the length of the sequences could be different, due to the sampling process to form the sequence. (2) Sequence data has a temporal and relational nature. E order of the items in the sequence plays an important role in the understanding of the sequence. Given two sequences of the same items but with different orders, their meaning could be completely different. Is is a critical, different nature different from the flat vector data, where the items of the vector are considered to be independent of each other and their orders are not important for the learning problem Given two sequences of the same items but with different orders, their meaning could be completely different. is is a critical, different nature different from the flat vector data, where the items of the vector are considered to be independent of each other and their orders are not important for the learning problem

Methods

Results

Conclusion

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

Journal: Shock and Vibration	Publication Date: Aug 31, 2021
Citations: 1	License type: CC BY 4.0

R Discovery Prime

R Discovery Prime

The Novel Sequence Distance Measuring Algorithm Based on Optimal Transport and Cross-Attention Mechanism

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Shock and Vibration

Lead the way for us

Similar Papers

A novel joint extraction model based on cross-attention mechanism and global pointer using context shield window
Zhengwei Zhai ... Lei Zhang
Computer Speech & Language | VOL. 87
Zhengwei Zhai, et. al.Zhengwei Zhai ... Lei Zhang
18 Mar 2024
Computer Speech & Language | VOL. 87

Mechanism design via optimal transport
Constantinos Daskalakis ... Alan Deckelbaum
-
Constantinos Daskalakis, et. al.Constantinos Daskalakis ... Alan Deckelbaum
16 Jun 2013
16 Jun 2013

Chain Rule Optimal Transport
Frank Nielsen ... Ke Sun
-
Frank Nielsen, et. al.Frank Nielsen ... Ke Sun
01 Jan 2020
01 Jan 2020

Twins transformer: rolling bearing fault diagnosis based on cross-attention fusion of time and frequency domain features
Zhikang Gao ... Jiachi Yao
Measurement Science and Technology | VOL. 35
Zhikang Gao, et. al.Zhikang Gao ... Jiachi Yao
13 Jun 2024
Measurement Science and Technology | VOL. 35

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

R Discovery Prime

The Novel Sequence Distance Measuring Algorithm Based on Optimal Transport and Cross-Attention Mechanism

Abstract

Highlights

Summary

Talk to us

Similar Papers

More From: Shock and Vibration