2014 IEEE International Symposium on Information Theory 2014
DOI: 10.1109/isit.2014.6875425
|View full text |Cite
|
Sign up to set email alerts
|

Sublinear algorithms for outlier detection and generalized closeness testing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
23
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 16 publications
(24 citation statements)
references
References 17 publications
1
23
0
Order By: Relevance
“…Finally, we remark that the binary classification problem is closely related with the so-called two sample homogeneity testing problem [9, Sec. II-C] and the closeness testing problem [10], [11], [12] where given two i.i.d. generated sequences X N and Y n , one aims to determine whether the two sequences are generated according to the same distribution or not.…”
Section: B Main Resultsmentioning
confidence: 99%
“…Finally, we remark that the binary classification problem is closely related with the so-called two sample homogeneity testing problem [9, Sec. II-C] and the closeness testing problem [10], [11], [12] where given two i.i.d. generated sequences X N and Y n , one aims to determine whether the two sequences are generated according to the same distribution or not.…”
Section: B Main Resultsmentioning
confidence: 99%
“…Importantly, our tester straightforwardly extends to unequal-sized samples, giving the first optimal tester in this setting. Closeness testing with unequal sized samples was considered in [AJOS14] that gives sample upper and lower bounds with a polynomial gap between them. Our tester uses m 1 = Ω(max(n 2/3 /ǫ 4/3 , n 1/2 /ǫ 2 )) samples from one distribution and m 2 = O(max(nm −1/2 1 /ǫ 2 , √ n/ǫ 2 )) from the other.…”
Section: Our Contributionsmentioning
confidence: 99%
“…The probability estimates are accurate up to a standard deviation of 0.003. The results of Figure 2 indicate the accuracy of the approximation predicted by (42) assuming that µ is a uniform distribution on an alphabet of size 8. Clearly, the approximation is quite accurate in this regime.…”
Section: B Outlier Hypothesis Testingmentioning
confidence: 77%
“…The objective is to determine whether or not both strings are drawn from identical distributions in P(Z). Homogeneity testing is also closely related to the problem of closeness testing [40]- [42]. As before we work in the regime where m and n are linearly related as m = λn where λ is a known constant.…”
Section: Homogeneity Testingmentioning
confidence: 99%