2021
DOI: 10.1214/20-ejs1780
|View full text |Cite
|
Sign up to set email alerts
|

Finite-sample analysis of $M$-estimators using self-concordance

Abstract: The classical asymptotic theory for parametric M -estimators guarantees that, in the limit of infinite sample size, the excess risk has a chi-square type distribution, even in the misspecified case. We demonstrate how self-concordance of the loss allows to characterize the critical sample size sufficient to guarantee a chi-square type in-probability bound for the excess risk. Specifically, we consider two classes of losses: (i) self-concordant losses in the classical sense of Nesterov and Nemirovski, i.e., who… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(17 citation statements)
references
References 48 publications
0
17
0
Order By: Relevance
“…For example, the logistic loss function used in logistic regression is not strictly self-concordant, but it fits into a class of pseudo-self-concordant functions, which allows one to obtain similar properties and bounds as those obtained for selfconcordant functions [Bach et al, 2010]. This was also the case in Ostrovskii & Bach [2021] and Tran-Dinh et al [2015], in which more general properties of these pseudo-self-concordant functions were established. This was fully formalized in Sun & Tran-Dinh [2019], in which the concept of generalized-self concordant functions was introduced, along with key bounds, properties, and variants of Newton methods for the unconstrained setting which make use of this property.…”
Section: Related Workmentioning
confidence: 89%
See 1 more Smart Citation
“…For example, the logistic loss function used in logistic regression is not strictly self-concordant, but it fits into a class of pseudo-self-concordant functions, which allows one to obtain similar properties and bounds as those obtained for selfconcordant functions [Bach et al, 2010]. This was also the case in Ostrovskii & Bach [2021] and Tran-Dinh et al [2015], in which more general properties of these pseudo-self-concordant functions were established. This was fully formalized in Sun & Tran-Dinh [2019], in which the concept of generalized-self concordant functions was introduced, along with key bounds, properties, and variants of Newton methods for the unconstrained setting which make use of this property.…”
Section: Related Workmentioning
confidence: 89%
“…Self-concordant functions have received strong interest in recent years due to the attractive properties that they allow to prove for many statistical estimation settings [Marteau-Ferey et al, 2019, Ostrovskii & Bach, 2021. The original definition of self-concordance has been expanded and generalized since its inception, as many objective functions of interest have self-concordant-like properties without satisfying the strict definition of self-concordance.…”
Section: Related Workmentioning
confidence: 99%
“…Therefore, the expected KL ( 13) is infinite for any number of samples. Instead of taking the expectation, one might want to bound the risk in high probability without resorting to Markov inequality, as achieved by (Ostrovskii and Bach, 2021), but this is a difficult endeavor. These examples make a case for regularized estimators such as MAP, for which we may find upper bounds.…”
Section: Problems Formulationmentioning
confidence: 99%
“…Overcoming this limitation, Ostrovskii and Bach (2021) characterize the number of samples needed to be upper bounded by a quadratic with high-probability, for any parametric models with a self-concordant log-likelihood f . Anastasiou and Reinert (2017) obtains a similar flavor of result under other assumptions on the third derivative of f .…”
Section: Locally Quadratic Casementioning
confidence: 99%
See 1 more Smart Citation