1990
DOI: 10.1007/3-540-52342-1_23
|View full text |Cite
|
Sign up to set email alerts
|

Random sampling from database files: A survey

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
52
0
2

Year Published

2003
2003
2024
2024

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 61 publications
(54 citation statements)
references
References 13 publications
0
52
0
2
Order By: Relevance
“…Olken and Rotem give an excellent survey of work in this area [23]. However, most of this work is very different than ours, in that it is concerned primarily with sampling from an existing database file, where it is assumed that the data to be sampled from are all present on disk and indexed by the database.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Olken and Rotem give an excellent survey of work in this area [23]. However, most of this work is very different than ours, in that it is concerned primarily with sampling from an existing database file, where it is assumed that the data to be sampled from are all present on disk and indexed by the database.…”
Section: Related Workmentioning
confidence: 99%
“…The most well-known papers in this area are due to Olken and Rotem [22][ 24], who also offer the definitive survey of related work through the early 1990's [23]. However, this work is relevant mostly for sampling from data stored in a database, and is not suitable for emerging applications such as stream-based data management.…”
Section: Introductionmentioning
confidence: 99%
“…In addition to the previous techniques, window query selectivity on nonuniform data can be estimated using fractals and power laws [Belussi and Faloutsos 1995;Faloutsos 1998, 2001], sampling [Olken and Rotem 1990;Palmer and Faloutsos 2000;Chaudhuri et al 2001;Wu et al 2001], kernel estimation [Blohsfeld et al 1999], single value decomposition [Poosala and Ioannidis 1997], compressed histograms [Matias et al 1998[Matias et al , 2000Lee et al Fig. 2.…”
Section: Selectivity and Nearest Distance In Spatial Databasesmentioning
confidence: 99%
“…Such information can be used for statistical analyses of databases, where approximate answers would suffice. It may also be used to estimate selectivities or intermediate result sizes for query optimization [11]. In the context of association rules, sampling can be utilized to gather quick preliminary rules.…”
Section: Introductionmentioning
confidence: 99%