2009 Ninth IEEE International Conference on Data Mining 2009
DOI: 10.1109/icdm.2009.15
|View full text |Cite
|
Sign up to set email alerts
|

Finding Time Series Motifs in Disk-Resident Data

Abstract: Abstract-Time series motifs are sets of very similar subsequences of a long time series. They are of interest in their own right, and are also used as inputs in several higher-level data mining algorithms including classification, clustering, rule-discovery and summarization. In spite of extensive research in recent years, finding exact time series motifs in massive databases is an open problem. Previous efforts either found approximate motifs or considered relatively small datasets residing in main memory. In… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
41
0

Year Published

2010
2010
2019
2019

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 38 publications
(41 citation statements)
references
References 26 publications
(59 reference statements)
0
41
0
Order By: Relevance
“…Subsequence similarity search, the task of finding a region of much longer time series that matches a specified query time series within a given threshold, is a fundamental subroutine in many higher level data mining tasks such as motif discovery [19], anomaly detection [4], association discovery, and classification [20][1] [33].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Subsequence similarity search, the task of finding a region of much longer time series that matches a specified query time series within a given threshold, is a fundamental subroutine in many higher level data mining tasks such as motif discovery [19], anomaly detection [4], association discovery, and classification [20][1] [33].…”
Section: Introductionmentioning
confidence: 99%
“…To consider one concrete example, time series motif discovery is a useful tool with applications in dozens of domains. A recent paper introduced a technique to find motifs in datasets containing millions of objects in just hours, a significant speed-up [19]. This method explicitly assumes the Euclidean Distance; however, for the related problem of classification, it is wellknown that DTW is significantly more accurate [7][25] [33].…”
Section: Introductionmentioning
confidence: 99%
“…The approach of [9] is the first tractable exact motif discovery algorithm based on the combination of early abandoning the Euclidean distance calculation and a heuristic search guided by the linear ordering of data. The authors also introduced for the first time a disk-aware algorithm for exact motif discovery for massive disk-resident datasets [11]. Although there has been significant research effort spent on efficiently discovering time series motifs, most of the literature has focused on fast and scalable approximate or exact algorithms for finding motifs in static offline databases.…”
Section: Related Workmentioning
confidence: 99%
“…Using these bounds, a superset of the k-NN answers can be returned, which will be then verified using the uncompressed sequences that will need to be fetched and compared with the query, so that the exact distances can be computed. Such filtering ideas are used in the majority of the data-mining literature for speeding up search operations [6,7,17].…”
Section: Searching Data Using Distance Estimatesmentioning
confidence: 99%