Proceedings of the 19th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 2013
DOI: 10.1145/2487575.2487650
|View full text |Cite
|
Sign up to set email alerts
|

A general bootstrap performance diagnostic

Abstract: As datasets become larger, more complex, and more available to diverse groups of analysts, it would be quite useful to be able to automatically and generically assess the quality of estimates, much as we are able to automatically train and evaluate predictive models such as classifiers. However, despite the fundamental importance of estimator quality assessment in data analysis, this task has eluded highly automatic solutions. While the bootstrap provides perhaps the most promising step in this direction, its … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
30
0

Year Published

2014
2014
2023
2023

Publication Types

Select...
3
2
2

Relationship

1
6

Authors

Journals

citations
Cited by 17 publications
(30 citation statements)
references
References 19 publications
0
30
0
Order By: Relevance
“…We call such a procedure a diagnostic. We are going to use a diagnostic recently developed by Kleiner et al [22], but first let us provide some intuition for how a diagnostic should work. We consider first an impractical ideal diagnostic.…”
Section: Diagnosismentioning
confidence: 99%
See 3 more Smart Citations
“…We call such a procedure a diagnostic. We are going to use a diagnostic recently developed by Kleiner et al [22], but first let us provide some intuition for how a diagnostic should work. We consider first an impractical ideal diagnostic.…”
Section: Diagnosismentioning
confidence: 99%
“…Algorithm 1 is the diagnostic of Kleiner et al [22], specialized to query approximation. We provide it here for completeness.…”
Section: Appendix a Diagnostic Algorithmmentioning
confidence: 99%
See 2 more Smart Citations
“…This derivation is a manual process that is ad-hoc and often impractical for complex queries [34]. 1 To address this problem, a second approach, called bootstrap, has emerged as a more general method for routine estimation of errors [27,30,34]. Bootstrap [19] is essentially a Monte Carlo procedure, which for a given initial sample, (i) repeatedly forms simulated datasets by resampling tuples i.i.d.…”
Section: Introductionmentioning
confidence: 99%