2007 IEEE 23rd International Conference on Data Engineering 2007
DOI: 10.1109/icde.2007.369004
|View full text |Cite
|
Sign up to set email alerts
|

Summarizing Order Statistics over Data Streams with Duplicates

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
7
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
4
1

Relationship

1
4

Authors

Journals

citations
Cited by 6 publications
(7 citation statements)
references
References 23 publications
0
7
0
Order By: Relevance
“…Table V shows candidate limits of probabilistic k-skybands for typical values of parameters k and window sizes n when σ = 0.001. We can see that the number of candidates is much smaller than window size n, and decreases significantly compared to n for large values of n. For small values of parameter k, the number of candidates is larger in the probabilistic k-skyband than in the k-skyband since the expected size of the latter is k[ln( n k ) − 1] for random-order data streams, as shown in Zhang [2008]. 4.1.2.…”
Section: Theorem 44 Let Q Be a Top-k/w Query Over A Count-based Winmentioning
confidence: 91%
“…Table V shows candidate limits of probabilistic k-skybands for typical values of parameters k and window sizes n when σ = 0.001. We can see that the number of candidates is much smaller than window size n, and decreases significantly compared to n for large values of n. For small values of parameter k, the number of candidates is larger in the probabilistic k-skyband than in the k-skyband since the expected size of the latter is k[ln( n k ) − 1] for random-order data streams, as shown in Zhang [2008]. 4.1.2.…”
Section: Theorem 44 Let Q Be a Top-k/w Query Over A Count-based Winmentioning
confidence: 91%
“…Existing solution on top-k/w processing ( [29,8,9,6,30,10,31,7,20,19,21]) assume centralized processing at a single network node and thus differ significantly from the distributed top-k/w processing approach we present in this paper. They can be classified in two categories: deterministic approaches ( [8,9,6,30,20,19,21]) which produce correct results to defined queries, and probabilistic approaches ( [29,7,19]) which generate errors and thus produce approximate results, but are in general more efficient and require less memory than the deterministic approaches.…”
Section: Data Stream Processing Systemsmentioning
confidence: 95%
“…Furthermore, as a top-k/w query continuously identifies k best-ranked data objects in the query window with respect to an arbitrary scoring function, we can additionally classify existing algorithms according to the type of supported scoring functions. Examples are distance [29,9,6], aggregation [8,30,31] and relevance [20,21] scoring functions.…”
Section: Data Stream Processing Systemsmentioning
confidence: 99%
“…These works can be classified in two categories: deterministic approaches [4,9,11,[18][19][20]26] which produce correct results to defined queries, and probabilistic approaches [13,14,26] which generate errors and thus produce approximate results, but are in general more efficient and require less memory than the deterministic approaches. Furthermore, as a top-k/w query continuously identifies k best-ranked data objects in the query window with respect to an arbitrary scoring function, we can additionally classify these works according to whether distance [4,14,20], aggregation [9,18,33] or relevance [11,19] scoring function is assumed. Following this categorization, k-NN/w queries are topk/w queries with distance scoring functions.…”
Section: Related Workmentioning
confidence: 99%