The case for sampling on very large file systems

Goldberg, George; Harnik, Danny; Sotnikov, Dmitriy V.

doi:10.1109/msst.2014.6855542

Search citation statements

Order By: Relevance

Paper Sections

Select...

Introduction1

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2015

Publication Types

Select...

Book1

Relationship

Self Cite0

Independent1

Authors

Journals

Cited by 1 publication

(1 citation statement)

References 22 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This algorithm, too, is efficient and can be applied to data streams. Random sampling is still an active research field and new sampling schemes are studied in various contexts; some indicative examples are sampling from sliding windows [13], from distributed data streams [4,15,5], from streams with time decay [6], independent range sampling [10], sampling on very large file systems [9], and stratified reservoir sampling [2]. In light of the above results (which are mainly from the data streams field), we consider the algorithms of [3] and [8] as fundamental sampling schemes for general purpose weighted random sampling over data streams.…”

Section: Introductionmentioning

confidence: 99%