Estimating Network Flowing over Edges by Recursive Network Embedding

Yu, Liangli; Wang, Hongqi; Mo, Haoran

doi:10.1155/2020/8893381

Cited by 1 publication

(1 citation statement)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…e most popular way to handle sequence data is to map a sequence to a flat vector and then apply the conventional methods. However, this methodology usually cannot capture the sequential feature of the data; thus, the results are not satisfying [4,[7][8][9][10][11][12].Comparing the similarity/ dissimilarity of a pair of sequences is a fundamental problem of sequence data analysis and understanding. e applications include the similarity search [13][14][15][16] and nearest neighbor-based classification [17,18].…”

Section: Introductionmentioning

confidence: 99%

The Novel Sequence Distance Measuring Algorithm Based on Optimal Transport and Cross‐Attention Mechanism

Lai

Yan³

et al. 2021

Shock and Vibration

View full text Add to dashboard Cite

In this paper, we propose a novel sequence distance measuring algorithm based on optimal transport (OT) and cross-attention mechanism. Given a source sequence and a target sequence, we first calculate the ground distance between each pair of source and target terms of the two sequences. The ground distance is calculated over the subsequences around the two terms. We firstly pay attention from each the source terms to each target terms with attention weights, so that we have a representative source subsequence vector regarding each term in the target subsequence. Then, we pay attention from each representative vector of the term of the target subsequence to the entire source subsequence. In this way, we construct the cross-attention weights and use them to calculate the pairwise ground distances. With the ground distances, we derive the OT distance between the two sequences and train the attention parameters and ground distance metric parameters together. The training process is conducted with training triplets of sequences, where each triplet is composed of an anchor sequence, a must-link sequence, and a cannot-link sequence. The corresponding hinge loss function of each triplet is minimized, and we develop an iterative algorithm to solve the optimal transport problem and the attention/ground distance metric parameters in an alternate way. The experiments over sequence similarity search benchmark datasets, including text, video, and rice smut protein sequence data, are conducted. The experimental results show the algorithm is effective.

show abstract