2016
DOI: 10.1007/978-3-319-47099-3_11
|View full text |Cite
|
Sign up to set email alerts
|

DSS: A Scalable and Efficient Stratified Sampling Algorithm for Large-Scale Datasets

Abstract: Statistical analysis of aggregated records is widely used in various domains such as market research, sociological investigation and network analysis, etc. Stratified sampling (SS), which samples the population divided into distinct groups separately, is preferred in the practice for its high effectiveness and accuracy. In this paper, we propose a scalable and efficient algorithm named DSS, for SS to process large datasets. DSS executes all the sampling operations in parallel by calculating the exact subsample… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
1
1

Relationship

1
1

Authors

Journals

citations
Cited by 2 publications
(2 citation statements)
references
References 15 publications
0
2
0
Order By: Relevance
“…Stratified sampling is a sampling method involving the division of a population into distinct groups known as strata [21]. These strata are homogeneous subgroups of the original data with similar inner items.…”
Section: Stratified Samplingmentioning
confidence: 99%
See 1 more Smart Citation
“…Stratified sampling is a sampling method involving the division of a population into distinct groups known as strata [21]. These strata are homogeneous subgroups of the original data with similar inner items.…”
Section: Stratified Samplingmentioning
confidence: 99%
“…Therefore, maintaining a sampling distribution according to this influence during hard example mining should enhance the performance of object detectors. S-OHEM exploits stratified sampling, a sampling method involving the division of a population into distinct groups known as strata [21] (homogeneous subgroups, in which the inner items are similar to each other). During each mini-batch iteration, S-OHEM firstly assigns candidate examples (in the form of Region of Interests, RoIs) to different strata by the ratio between classification and localization loss.…”
Section: Introductionmentioning
confidence: 99%