2021
DOI: 10.48550/arxiv.2101.05657
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

No-go Theorem for Acceleration in the Hyperbolic Plane

Abstract: In recent years there has been significant effort to adapt the key tools and ideas in convex optimization to the Riemannian setting. One key challenge has remained: Is there a Nesterovlike accelerated gradient method for geodesically convex functions on a Riemannian manifold? Recent work has given partial answers and the hope was that this ought to be possible.Here we dash these hopes. We prove that in a noisy setting, there is no analogue of accelerated gradient descent for geodesically convex functions on th… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(11 citation statements)
references
References 21 publications
0
11
0
Order By: Relevance
“…To explain the issue, let d(A, v, r) be the point r units from the given point A in direction v. Suppose that, due to the numerical precision issues, we represent the direction v as v , where |v − v | ≤ . However, a circle of radius r has circumference of 2π sinh(r), which is exponential in r, and thus the distance between d(A, v, r) and d(A, v , r) can be of order × e r [11]. In general, the number of tiles in r steps from the center is exponential in r, so any representation using a fixed number of bits will not be able to discern between the coordinates of two tiles if r is large enough, on the order of the number of bits.…”
Section: Previous Methodsmentioning
confidence: 99%
“…To explain the issue, let d(A, v, r) be the point r units from the given point A in direction v. Suppose that, due to the numerical precision issues, we represent the direction v as v , where |v − v | ≤ . However, a circle of radius r has circumference of 2π sinh(r), which is exponential in r, and thus the distance between d(A, v, r) and d(A, v , r) can be of order × e r [11]. In general, the number of tiles in r steps from the center is exponential in r, so any representation using a fixed number of bits will not be able to discern between the coordinates of two tiles if r is large enough, on the order of the number of bits.…”
Section: Previous Methodsmentioning
confidence: 99%
“…Cutting plane methods, such as ellipsoid, require an exponential bound on the volume of a known region containing an approximate optimizer. This is the case for Rusciano's non-constructive query upper bound for cutting plane methods on manifolds of non-positive curvature [Rus20], which is essentially tight [HM21] . The volume of a ball in the manifold we consider grows exponentially in the radius (see Section 4.5), so this query bound will be exponential.…”
Section: Theorem 14 (Noncommutative Diameter Lower Bound)mentioning
confidence: 99%
“…When studying the global complexity of Riemannian optimization algorithms, it is common to assume that the sectional curvature of M is bounded below by K min and bounded above by K max to prevent the manifold from being overly curved. Unfortunately, [13,14] show that even when sectional curvature is bounded, achieving global acceleration is impossible in general. Thus, one might need another common assumption, an upper bound D of diam(N ).…”
Section: Introductionmentioning
confidence: 99%