2021
DOI: 10.48550/arxiv.2110.15824
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Tractability from overparametrization: The example of the negative perceptron

Abstract: In the negative perceptron problem we are given n data points (xi, yi), where xi is a d-dimensional vector and yi ∈ {+1, −1} is a binary label. The data are not linearly separable and hence we content ourselves to find a linear classifier with the largest possible negative margin. In other words, we want to find a unit norm vector θ that maximizes min i≤n yi θ, xi . This is a non-convex optimization problem (it is equivalent to finding a maximum norm vector in a polytope), and we study its typical properties u… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
1

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(6 citation statements)
references
References 54 publications
0
6
0
Order By: Relevance
“…to soft sphere packing), studies of the spherical perceptron in physics and mathematics are numerous. Without aiming at being exhaustive, and rather primarily referring to works relevant for our presentation, these studies include [GD88, Gar88, FPS + 17] in the physics literature, while the spherical perceptron has also been studied with mathematically rigorous techniques [ST02], [Tal10, Chapter 3], [Tal11, Chapter 8], [Sto13a,Sto13b,MZZ21]. In particular, the satisfiability threshold α = 2 has been rigorously determined.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…to soft sphere packing), studies of the spherical perceptron in physics and mathematics are numerous. Without aiming at being exhaustive, and rather primarily referring to works relevant for our presentation, these studies include [GD88, Gar88, FPS + 17] in the physics literature, while the spherical perceptron has also been studied with mathematically rigorous techniques [ST02], [Tal10, Chapter 3], [Tal11, Chapter 8], [Sto13a,Sto13b,MZZ21]. In particular, the satisfiability threshold α = 2 has been rigorously determined.…”
Section: Related Workmentioning
confidence: 99%
“…In this model, even computing the Gardner capacity conjecturally requires the full-RSB prediction. However, [Sto13b,MZZ21] have made a refined use of Gordon's inequality to improve over the replica-symmetric upper bound for the capacity. While similar ideas might be able to improve the upper bound of Theorem 1.8, it is not immediate to implement them, since the method used in [MZZ21] relies on the min-max problem being formulated over unit-norm vectors, which is not the case here.…”
Section: Stability Of the Inverse -While The Injectivity Question Is ...mentioning
confidence: 99%
See 1 more Smart Citation
“…In fact, the capacity of generalized linear classi ers to t random labels for the simple case of Gaussian generated data, is also a classical topic, e.g. [16][17][18][19][20][21][22][23][24].…”
Section: Introductionmentioning
confidence: 99%
“…As mentioned earlier, the perceptron model was analyzed in the (nonrigorous) statistical physics literature in the 1980s [Gar88,GD88,KM89,Méz89]. We refer to [Cov65,Gar88,MRSY19,MZZ21] for discussions of statistical motivations for the model.…”
mentioning
confidence: 99%