2018
DOI: 10.1007/s10618-018-0565-y
|View full text |Cite
|
Sign up to set email alerts
|

Optimizing dynamic time warping’s window width for time series data mining applications

Abstract: Dynamic Time Warping (DTW) is a highly competitive distance measure for most time series data mining problems. Obtaining the best performance from DTW requires setting its only parameter, the maximum amount of warping (w). In the supervised case with ample data, w is typically set by cross-validation in the training stage. However, this method is likely to yield suboptimal results for small training sets. For the unsupervised case, learning via cross-validation is not possible because we do not have access to … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
4
3

Relationship

0
7

Authors

Journals

citations
Cited by 73 publications
(26 citation statements)
references
References 57 publications
0
26
0
Order By: Relevance
“…Here, we used biological insights from a previous study to set ε at a spatial scale reflecting that at which GPS-tracked gannets typically forage, but one could set ε to reflect known location error from ones tracking device for example. For DTW, LCSS and EDR one must also set a δ value and while many studies adopt an unconstrained approach as we do here adjusting this parameter can sometimes improve clustering performance (Dau et al 2018). We also note that the similarity measures covered here represent only a subset of available trajectory similarity measures (Ranacher and Tzavella 2014) and that rather than having to choose between similarity measures it may be possible to use them as an ensemble for machine learning methods of time-series classification purposes (Lines and Bagnall 2015).…”
Section: Discussionmentioning
confidence: 99%
“…Here, we used biological insights from a previous study to set ε at a spatial scale reflecting that at which GPS-tracked gannets typically forage, but one could set ε to reflect known location error from ones tracking device for example. For DTW, LCSS and EDR one must also set a δ value and while many studies adopt an unconstrained approach as we do here adjusting this parameter can sometimes improve clustering performance (Dau et al 2018). We also note that the similarity measures covered here represent only a subset of available trajectory similarity measures (Ranacher and Tzavella 2014) and that rather than having to choose between similarity measures it may be possible to use them as an ensemble for machine learning methods of time-series classification purposes (Lines and Bagnall 2015).…”
Section: Discussionmentioning
confidence: 99%
“…Also in this case we perform zero padding to fit all the time-series lengths to the size of the longest one. -The Dynamic Time Warping measures [7] (DTW ) coupled with K-means algorithm. Such distance measure is especially tailored for time-series data with variable length-size.…”
Section: Competitors and Methods Ablationsmentioning
confidence: 99%
“…The bandwidth r defines the constraint range of the matching path in the distance matrix and suppresses the influence of undesired convergence in the matching path [52]. Because there was a correlation between the defined warping offset distance and the SDTW algorithm, as well as the SDTW-based distance and the constraint bandwidth r, different r not only affected the optimal matching path of the SDTW but also led to the change of d similarity .…”
Section: Similarity Measure Evaluation With One Nearest Neighbor (1-nmentioning
confidence: 99%
“…For case II, it can be considered that the constraint bandwidth did not affect the distance measured by the SDTW algorithm, and the first r corresponding to the minimum can be seen as the candidate. For the situation in case III that multiple candidate values within the convergence region corresponded to the same minimum value E SUM , the median of these candidate values was selected as r. Here, the general rules for determining and adjusting the preset range for r can refer to [52].…”
Section: Similarity Measure Evaluation With One Nearest Neighbor (1-nmentioning
confidence: 99%