2012
DOI: 10.1111/j.1467-9868.2011.01014.x
|View full text |Cite
|
Sign up to set email alerts
|

Fast Subset Scan for Spatial Pattern Detection

Abstract: Summary. We propose a new 'fast subset scan' approach for accurate and computationally efficient event detection in massive data sets. We treat event detection as a search over subsets of data records, finding the subset which maximizes some score function. We prove that many commonly used functions (e.g. Kulldorff's spatial scan statistic and extensions) satisfy the 'linear time subset scanning' property, enabling exact and efficient optimization over subsets. In the spatial setting, we demonstrate that proxi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
160
0

Year Published

2014
2014
2024
2024

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 176 publications
(160 citation statements)
references
References 27 publications
0
160
0
Order By: Relevance
“…where U (S, αmax) refers to the union of {αmax} and the set of distinct p-values less than αmax in S. Because the BJ scan statistic satisfies the linear time subset scanning (LTSS) property [16], the subproblem…”
Section: Efficient Non-parametric Scanningmentioning
confidence: 99%
“…where U (S, αmax) refers to the union of {αmax} and the set of distinct p-values less than αmax in S. Because the BJ scan statistic satisfies the linear time subset scanning (LTSS) property [16], the subproblem…”
Section: Efficient Non-parametric Scanningmentioning
confidence: 99%
“…The main idea is to evaluate all subsets of the data for possible events. Of course, enumerating all possible O (2 n ) subsets is infeasible for even moderately sized problems, so later variations impose constraints on space and time to reduce the number of subsets, which is often done by scanning only subsets of distinct sizes and shapes [85,86]. This approach reduces the complexity to O n 2 (and to O (n) in [86]) but may also impair the detection performance if important signals are not well captured by tested subsets.…”
Section: Scan Statisticsmentioning
confidence: 99%
“…Linear Time Subset Scanning was recently developed to further reduce the complexity [86]. It states that for any scoring functions G, e.g., the ratio of count to baseline G(…”
Section: Scan Statisticsmentioning
confidence: 99%
See 2 more Smart Citations