2008
DOI: 10.1090/crmp/045/07
|View full text |Cite
|
Sign up to set email alerts
|

Bilevel model selection for support vector machines

Abstract: Abstract. The successful application of Support Vector Machines (SVMs), kernel methods and other statistical machine learning methods requires selection of model parameters based on estimates of the generalization error. This paper presents a novel approach to systematic model selection through bilevel optimization. We show how modelling tasks for widely used machine learning methods can be formulated as bilevel optimization problems and describe how the approach can address a broad range of tasks-among which … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
18
0

Year Published

2008
2008
2022
2022

Publication Types

Select...
4
2
1

Relationship

1
6

Authors

Journals

citations
Cited by 20 publications
(18 citation statements)
references
References 46 publications
0
18
0
Order By: Relevance
“…They are nonconvex and nonsmooth, but their number remains in the size of w and no new variables are introduced. This hopefully permits better scalability than MPEC bilevel attempts (Bennett et al 2006;Kunapuli et al 2008). A penalty approach is used to bring these constraints into the outer-level objective.…”
Section: Explicit Model Selection Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…They are nonconvex and nonsmooth, but their number remains in the size of w and no new variables are introduced. This hopefully permits better scalability than MPEC bilevel attempts (Bennett et al 2006;Kunapuli et al 2008). A penalty approach is used to bring these constraints into the outer-level objective.…”
Section: Explicit Model Selection Methodsmentioning
confidence: 99%
“…Model selection via T -fold cross-validation can be written compactly as single bilevel program (Bennett et al 2006;Kunapuli et al 2008). The inner-level problems minimize the regularized training error to determine the best function for the given hyperparameters for each fold.…”
Section: Bilevel Cross-validation Problemmentioning
confidence: 99%
See 2 more Smart Citations
“…A similar non-convex optimization problem has been tackled by Kunapuli et al (2007). The fundamental difference between their setting and ours is that they had a single bi-level optimization problem, while we have a family of such problems, parameterized by τ .…”
Section: E(l τ (Y − C)|x) = Quantile τ Of P (Y |X) (2)mentioning
confidence: 99%