Proceedings 41st Annual Symposium on Foundations of Computer Science
DOI: 10.1109/sfcs.2000.892113
|View full text |Cite
|
Sign up to set email alerts
|

Testing that distributions are close

Abstract: Given two distributions over an n element set, we wish to check whether these distributions are statistically close by only sampling. We give a sublinear algorithm which uses O(n 2/3 −4 log n) independent samples from each distribution, runs in time linear in the sample size, makes no assumptions about the structure of the distributions, and distinguishes the cases when the distance between the distributions is small (less than max(We also give an Ω(n 2/3 −2/3 ) lower bound. Our algorithm has applications to t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

5
235
0

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 169 publications
(240 citation statements)
references
References 29 publications
5
235
0
Order By: Relevance
“…In recent years, a number of algorithms for these testing problems have been designed which require a number of samples that is only sublinear in the size of the domain, while making no assumptions on the form of the distribution. For example, on arbitrary domains of size N , testing whether a distribution is close to uniform in statistical distance can be performed with onlyÕ( √ N ) samples [7,2], and distinguishing whether two distributions are the same or far in statistical distance can be performed withÕ(N 2/3 ) samples [4]. Similar results have been obtained for testing whether a joint distribution is independent and estimating the entropy [2,3].…”
Section: Introductionsupporting
confidence: 54%
See 3 more Smart Citations
“…In recent years, a number of algorithms for these testing problems have been designed which require a number of samples that is only sublinear in the size of the domain, while making no assumptions on the form of the distribution. For example, on arbitrary domains of size N , testing whether a distribution is close to uniform in statistical distance can be performed with onlyÕ( √ N ) samples [7,2], and distinguishing whether two distributions are the same or far in statistical distance can be performed withÕ(N 2/3 ) samples [4]. Similar results have been obtained for testing whether a joint distribution is independent and estimating the entropy [2,3].…”
Section: Introductionsupporting
confidence: 54%
“…Theorem 4 There is an efficient algorithm TestUniform (see Figure 1) which, given generator access to an unknown monotone distribution p over {−1, 1} n , makes O( n 2 log n ) draws and satisfies the following properties: (i) If p ≡ U then TestUniform outputs "uniform" with probability at least 4 5 ; (ii) If p − U 1 ≥ then TestUniform outputs "nonuniform" with probability at least 4 5 .…”
Section: A Uniformity Testing Algorithmmentioning
confidence: 99%
See 2 more Smart Citations
“…Some examples include testing uniformity [20,8], independence [7], monotonicity and being unimodal [9], estimating the support sizes [34] and testing a weaker notion than k-wise independence, namely, "almost k-wise independence" [1].…”
Section: Other Related Researchmentioning
confidence: 99%