“…This algorithm, too, is efficient and can be applied to data streams. Random sampling is still an active research field and new sampling schemes are studied in various contexts; some indicative examples are sampling from sliding windows [13], from distributed data streams [4,15,5], from streams with time decay [6], independent range sampling [10], sampling on very large file systems [9], and stratified reservoir sampling [2]. In light of the above results (which are mainly from the data streams field), we consider the algorithms of [3] and [8] as fundamental sampling schemes for general purpose weighted random sampling over data streams.…”