2000
DOI: 10.1007/3-540-45571-x_14
|View full text |Cite
|
Sign up to set email alerts
|

A Simple Dimensionality Reduction Technique for Fast Similarity Search in Large Time Series Databases

Abstract: Abstract.We address the problem of similarity search in large time series databases. We introduce a novel-dimensionality reduction technique that supports an indexing algorithm that is more than an order of magnitude faster than the previous best known method. In addition to being much faster our approach has numerous other advantages. It is simple to understand and implement, allows more flexible distance measures including weighted Euclidean queries and the index can be built in linear time. We call our appr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
107
0
2

Year Published

2002
2002
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 178 publications
(109 citation statements)
references
References 16 publications
0
107
0
2
Order By: Relevance
“…Agrawal et al [17] map time series into frequency domain by the discrete Fourier transform and only retain the first few frequencies. Keogh and Pazzani [18] reduce the dimension of time series data by segmenting the time series into sections and indexing only the section means. Agarwal et al [19] index market basket data by a specific signature table, which easens the similarity search.…”
Section: Related Workmentioning
confidence: 99%
“…Agrawal et al [17] map time series into frequency domain by the discrete Fourier transform and only retain the first few frequencies. Keogh and Pazzani [18] reduce the dimension of time series data by segmenting the time series into sections and indexing only the section means. Agarwal et al [19] index market basket data by a specific signature table, which easens the similarity search.…”
Section: Related Workmentioning
confidence: 99%
“…The reduction of dimensionality in the x-axis is obtained by dividing the total length of the time series into fragments of a certain size (word size). It is also necessary to establish a number of intervals in the y-axis to compress the values of the time series (size of the alphabet) (Keogh & Pazzani, 2000). In this case, the parameters of word size and alphabet were selected experimentally.…”
Section: Time Series Extractionmentioning
confidence: 99%
“…This nonlinear alignment results in a more sophisticated distance measure as shown in Figure 2(B). There are numerous papers on using DTW for time series data, including [6] [7][8] [9][10]. Berndt and Clifford [6] describe a DTW algorithm as follows:…”
Section: Dynamic Time Warpingmentioning
confidence: 99%
“…As a result, we explored techniques for pattern matching and pattern discovery in both continuous and discrete events. For pattern matching in continuous data, we studied the use of Dynamic Time Warping (DTW) [6] [7][8][9] [10] or Hidden Markov Models. We had a limited set of analog data to work with for this aspect of the project and were not able to proceed beyond the initial investigatory steps.…”
Section: Summary and Future Workmentioning
confidence: 99%
See 1 more Smart Citation