The Power of Comparisons for Actively Learning Linear Classifiers

Hopkins, Max; Kane, Daniel M.; Lovett, Shachar

doi:10.48550/arxiv.1907.03816

Cited by 6 publications

(24 citation statements)

References 13 publications

(16 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our algorithms require the use of comparison queries, an addition which we show is necessary in many cases for active PAC and ARPU-learning. Along with recalling lower bounds from [6] which show comparisons are necessary for efficiently active learning non-homogeneous hyperplanes, we show that in the noiseless case it is impossible to ARPU-learn the uniform distribution over S 1 in a finite number of label queries. Further, even with the addition of a margin assumption we show the existence of simple distributions which require a number of label queries that is exponential in dimension.…”

Section: Introductionmentioning

confidence: 94%

“…By allowing the learner to ask more complicated questions of the oracle, such as comparing two points, Kane et al [4] showed that non-homogeneous linear separators in two-dimensions can be learned in exponentially fewer labeled samples than the PAC case. Later, Kane, Lovett, and Moran [5] extended this to higher dimensions using a complicated set of queries, and Hopkins, Kane, and Lovett [6] did the same by assuming weak concentration and anti-concentration on the distribution -conditions once again satisfied by s-concave distributions.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Noise-tolerant, Reliable Active Classification with Comparison Queries

Hopkins¹,

Kane²,

Lovett³

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

With the explosion of massive, widely available unlabeled data in the past years, finding label and time efficient, robust learning algorithms has become ever more important in theory and in practice. We study the paradigm of active learning, in which algorithms with access to large pools of data may adaptively choose what samples to label in the hope of exponentially increasing efficiency. By introducing comparisons, an additional type of query comparing two points, we provide the first time and query efficient algorithms for learning non-homogeneous linear separators robust to bounded (Massart) noise. We further provide algorithms for a generalization of the popular Tsybakov low noise condition, and show how comparisons provide a strong reliability guarantee that is often impractical or impossible with only labels -returning a classifier that makes no errors with high probability.

show abstract

Section: Introductionmentioning

confidence: 94%

Section: Introductionmentioning

confidence: 99%

Noise-tolerant, Reliable Active Classification with Comparison Queries

Hopkins¹,

Kane²,

Lovett³

et al. 2020

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…This encompasses not only KLMZ's [3] notion of enriched queries, but also the original "Membership query" model of Angluin [19] who allowed the learner to query any point in the overall instance space X rather than just on the subsample S ⊂ X. This model is also particularly well-studied for halfspaces where it is called the pointlocation problem [14,20,21,22,23,9], and was actually studied originally by Meyer auf der Heide [14] in the perfect learning model even before Angluin's introduction of active learning.…”

Section: Related Workmentioning

confidence: 99%

“…In this setting, the learner is given an arbitrary finite sample S ⊂ R, and must infer the labels under an adversarially chosen classifier. Variants of this model have been studied in the computational geometry [14,20,21,22,23,9], statistical learning theory [13], and clustering literatures [29,37] under various names. Formally, we say a class (X, H) is perfectly learnable with respect to a query set Q in q(n) expected queries if there exists an algorithm A such that for every n ∈ N, every sample S ⊂ X of size n, and every hypothesis h ∈ H, A correctly labels all of S with respect to h in at most q(n) queries in expectation over the internal randomness of the algorithm.…”

Section: Perfect Learningmentioning

confidence: 99%

Active Learning Polynomial Threshold Functions

Ben‐Eliezer¹,

Hopkins²,

Chutong³

et al. 2022

Preprint

Self Cite

View full text Add to dashboard Cite

We initiate the study of active learning polynomial threshold functions (PTFs). While traditional lower bounds imply that even univariate quadratics cannot be non-trivially actively learned, we show that allowing the learner basic access to the derivatives of the underlying classifier circumvents this issue and leads to a computationally efficient algorithm for active learning degree-d univariate PTFs in Õ(d 3 log(1/εδ)) queries. We also provide near-optimal algorithms and analyses for active learning PTFs in several average case settings. Finally, we prove that access to derivatives is insufficient for active learning multivariate PTFs, even those of just two variables.

show abstract

“…Some of that work, particularly the work in online algorithms with a reject option, were targeted at non-stationary sequences of examples x. Even intervals are impossible to learn in online models, and in a supervised iid model Kivinen (1990) showed that exponentially many examples are required to learning rectangles under uniform distributions (as cited by Hopkins et al (2019); Goldwasser et al (2020)). Part of the challenge is that most definitions also require few test rejections, unlike PQ-learning's requirement of few rejections with respect to P .…”

Section: Related Workmentioning

confidence: 99%

Efficient Learning with Arbitrary Covariate Shift

Kalai¹,

Kanade²

2021

Preprint

View full text Add to dashboard Cite

We give an efficient algorithm for learning a binary function in a given class C of bounded VC dimension, with training data distributed according to P and test data according to Q, where P and Q may be arbitrary distributions over X. This is the generic form of what is called covariate shift, which is impossible in general as arbitrary P and Q may not even overlap. However, recently guarantees were given in a model called PQ-learning (Goldwasser et al., 2020) where the learner has: (a) access to unlabeled test examples from Q (in addition to labeled samples from P , i.e., semi-supervised learning); and (b) the option to reject any example and abstain from classifying it (i.e., selective classification). The algorithm of Goldwasser et al. ( 2020) requires an (agnostic) noise-tolerant learner for C. The present work gives a polynomial-time PQlearning algorithm, called Slice-and-Dice, that uses an oracle to a "reliable" learner for C, where reliable learning (Kalai et al., 2012) is a model of learning with one-sided noise. Furthermore, this reduction is optimal in the sense that we show the equivalence of reliable and PQ learning.

show abstract

The Power of Comparisons for Actively Learning Linear Classifiers

Cited by 6 publications

References 13 publications

Noise-tolerant, Reliable Active Classification with Comparison Queries

Noise-tolerant, Reliable Active Classification with Comparison Queries

Active Learning Polynomial Threshold Functions

Efficient Learning with Arbitrary Covariate Shift

Contact Info

Product

Resources

About