2013
DOI: 10.1007/978-3-642-40285-2_30
|View full text |Cite
|
Sign up to set email alerts
|

CoDS: A Representative Sampling Method for Relational Databases

Abstract: Abstract. Database sampling has become a popular approach to handle large amounts of data in a wide range of application areas such as data mining or approximate query evaluation. Using database samples is a potential solution when using the entire database is not cost-effective, and a balance between the accuracy of the results and the computational cost of the process applied on the large data set is preferred. Existing sampling approaches are either limited to specific application areas, to single table dat… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
9
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
2
1

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(9 citation statements)
references
References 19 publications
0
9
0
Order By: Relevance
“…The sizes of the tables range from 77 (District) to 1,056,320 tuples (Trans). The Financial database schema is depicted in [4]. The starting table identified by ReX is the District table.…”
Section: Discussionmentioning
confidence: 99%
See 4 more Smart Citations
“…The sizes of the tables range from 77 (District) to 1,056,320 tuples (Trans). The Financial database schema is depicted in [4]. The starting table identified by ReX is the District table.…”
Section: Discussionmentioning
confidence: 99%
“…Both ReX and UpSizeR aim to scale the distributions of the relationships between tables by s (i.e., through primary and foreign keys). In [4] we proposed a sampling method that aimed to scale the same distributions by a sampling factor. We use the average representativeness error metric defined in [4], replacing the sampling rate with the scaling rate.…”
Section: Discussionmentioning
confidence: 99%
See 3 more Smart Citations