Learning a Mahalanobis Distance-Based Dynamic Time Warping Measure for Multivariate Time Series Classification

Mei, Jiangyuan; Liu, Meizhu; Wang, Yuan-Fang; Gao, Huijun

doi:10.1109/tcyb.2015.2426723

Cited by 144 publications

(70 citation statements)

References 29 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…including our approach achieve up to 17% improvement in FPR relative to DTW [42], MDDTW [25], CTW [47] and GDTW [48]. Moreover, our approach further improves the results up to 6% and 0.8% for CMU mocap and Human3.6m datasets, respectively, against the state-of-the-art deep learning approaches [33,41,45,35].…”

Section: Action Recognitionmentioning

confidence: 81%

“…We compare our MMD-NCA loss against the methods from DTW [42], MDDTW [25], CTW [47] and GDTW [48], as well as four state-of-the-art deep metric learning approaches: DCTW [41], triplet [33], triplet+GOR [45], and the N -Pairs deep metric loss [14]. Primarily, these methods are evaluated through action recognition task in Sec.…”

Section: Resultsmentioning

confidence: 99%

“…Here, all deep metric learning approaches including our work significantly improve the accuracy against the DTW, MDDTW, CTW and GDTW. Overall, our method outperforms all the approaches for all FPR with a 20% improvement against DTW [42], MDDTW [25], CTW [47] and GDTW [48], and a 2% improvement compared to the state-of-the-art deep learning approaches [33,41,45,35]. Moreover, when we evaluate the NMI and the F 1 score for the clustering quality in different embedding sizes, Fig.…”

Section: Person Identificationmentioning

confidence: 86%

See 2 more Smart Citations

Human Motion Analysis with Deep Metric Learning

Coskun

Tan

Conjeti

et al. 2018

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Effectively measuring the similarity between two human motions is necessary for several computer vision tasks such as gait analysis, person identification and action retrieval. Nevertheless, we believe that traditional approaches such as L2 distance or Dynamic Time Warping based on hand-crafted local pose metrics fail to appropriately capture the semantic relationship across motions and, as such, are not suitable for being employed as metrics within these tasks. This work addresses this limitation by means of a triplet-based deep metric learning specifically tailored to deal with human motion data, in particular with the problem of varying input size and computationally expensive hard negative mining due to motion pair alignment. Specifically, we propose (1) a novel metric learning objective based on a triplet architecture and Maximum Mean Discrepancy; as well as, (2) a novel deep architecture based on attentive recurrent neural networks. One benefit of our objective function is that it enforces a better separation within the learned embedding space of the different motion categories by means of the associated distribution moments. At the same time, our attentive recurrent neural network allows processing varying input sizes to a fixed size of embedding while learning to focus on those motion parts that are semantically distinctive. Our experiments on two different datasets demonstrate significant improvements over conventional human motion metrics.

show abstract

Section: Action Recognitionmentioning

confidence: 81%

Section: Resultsmentioning

confidence: 99%

Section: Person Identificationmentioning

confidence: 86%

See 1 more Smart Citation

Human Motion Analysis with Deep Metric Learning

Coskun

Tan

Conjeti

et al. 2018

Lecture Notes in Computer Science

View full text Add to dashboard Cite

show abstract

“…A natural next step is to likewise "smooth out" the distance calculation in the temporal domain. We borrow the approach of Mei et al, 14 which extends the Dynamic Time Warping (DTW) algorithm-leveraged in Section II.A.1 to compute the distance between single-variable WITI factor patterns-to multi-dimensional data. In single-variable DTW, the optimal degree of local stretching and/or compression in the time domain-subject to various user-defined constraints such as the width parameter w-is based upon comparing scalar distances of the form (x t − y τ ) 2 between two time-series x and y at potentially different times t and τ such that |t − τ | ≤ w. In the multi-variable case, the distance between the two time-series at times t and τ can, for instance, be measured via the Euclidean distance (x t − y τ ) (x − y).…”

Section: Iib1 Correlating Spatio-temporal Delay Patterns To Tmi Stmentioning

confidence: 99%

Association Rules for Traffic Flow Management Decision Support

Vargo

Taylor

Wan

2016

16th AIAA Aviation Technology, Integration, and Operations Conference

View full text Add to dashboard Cite

“…In this work, Wavelet transform is chosen to design a feature extraction method for noisy environment because it captures time-frequency information of transient signal and gives multi resolution of time-frequency information. Traditional DTW deals with univariate time series(UTS) [3] which may not capture the similarities of all the dimension of the feature vector and the similarity check of two univariate time series sequence would not reflect correctly [19]. Moreover feature vectors are multi-dimensional, so there is a necessity for Multivariate Time Series (MTS) based DTW.…”

Section: Introductionmentioning

confidence: 99%

Modified Multivariate Euclidean Dynamic Time Warping Based Spoken Keyword Detection

Alex¹,

Nithya²

2017

IJIES

View full text Add to dashboard Cite

Traditional Dynamic Time Warping (DTW) technique find similarities between two one-dimensional time series sequence. Initially, in earlier decades, DTW was not preferred because of its computational complexity. However, due to the evolution of computing power, this has been revisited for spoken keyword detection recently. Conventional spectral features such as Mel-Frequency Cepstral Coefficients (MFCC) and contemporary wavelet features are multi-dimensional in nature which are used in speech recognition. In this work, a new strategy of DTW is proposed to work with multi-dimensional feature vector in calculating the local distance matrix. Additionally, a faster approach is specified to find the similarity in the global distance matrix. The proposed methods are evaluated with MFCC and wavelet features on a connected TIDIGITS corpus for spoken keyword detection system. Experimental results prove that there is an improvement in reduction of computational complexity compared to traditional DTW. Also, contemporary wavelet feature based spoken keyword detection system gave better detection accuracy than MFCC based spoken keyword detection system in the noisy environment.

show abstract

Learning a Mahalanobis Distance-Based Dynamic Time Warping Measure for Multivariate Time Series Classification

Cited by 144 publications

References 29 publications

Human Motion Analysis with Deep Metric Learning

Human Motion Analysis with Deep Metric Learning

Association Rules for Traffic Flow Management Decision Support

Modified Multivariate Euclidean Dynamic Time Warping Based Spoken Keyword Detection

Contact Info

Product

Resources

About