Classification model selection via bilevel programming

Kunapuli, Gautam; Bennett, Kristin P.; Hu, Jing; Pang, Jong-Shi

doi:10.1080/10556780802102586

Cited by 66 publications

(79 citation statements)

References 35 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Recent efforts [30][31][32] have focused on a bilevel optimization approach to implementing K-fold cross-validation. This approach may be beneficial to learn VISHID models by permitting faster SVM model Figure 11.…”

Section: Discussionmentioning

confidence: 99%

Modeling Choices for Virtual Screening Hit Identification

Bergeron

Krein

Moore

et al. 2011

Molecular Informatics

View full text Add to dashboard Cite

Making suitable modeling choices is crucial for successful in silico drug design, and one of the most important of these is the proper extraction and curation of data from qHTS screens, and the use of optimized statistical learning methods to obtain valid models. More specifically, we aim to learn the top-1 % most potent compounds against a variety of targets in a procedure we call virtual screening hit identification (VISHID). To do so, we exploit quantitative high-throughput screens (qHTS) obtained from PubChem, descriptors derived from molecular structures, and support vector machines (SVM) for model generation. Our results illustrate how an appreciation of subtle issues underlying qHTS data extraction and the resulting SVM models created using these data can enhance the effectiveness of solutions and, in doing so, accelerate drug discovery.

show abstract

Section: Discussionmentioning

confidence: 99%

Modeling Choices for Virtual Screening Hit Identification

Bergeron

Krein

Moore

et al. 2011

Molecular Informatics

View full text Add to dashboard Cite

show abstract

“…Many of the algorithms presented in this monograph have similar structures. It has already been demonstrated that bilevel machine learning problems perform fairly well with regards to generalization error [3,36] when solved using SQP-based methods such as filter. It should be noted that since the ultimate goal is to produce good generalization, the solutions found need not necessarily be highly accurate or global optimal.…”

Section: Discussionmentioning

confidence: 99%

“…In this section, we extend the parameter selection idea introduced in the previous section to support vector classification and show how the bilevel formulation can handle a large number of hyper-parameters; this is a review of work that was first introduced in [36]. The inner-level problem is the standard SVC model [12] augmented with additional feature selection constraints.…”

Section: Parameter Selection For Linear Sv Classificationmentioning

confidence: 99%

“…The scalars, b t , are collected into the vector b ∈ R T . Using this additional notation, the formulation of cross validation for SV classification as a bilevel program [36] is shown below.…”

Section: Parameter Selection For Linear Sv Classificationmentioning

confidence: 99%

“…Inexact cross validation was used to solve bilevel cross validation for support vector regression, (2.5), in [3] and support vector classification, (4.1), in [36] using filter. In spite of the fact that filter provides no guarantee of global optimality and generally converges to locally optimal solutions, this method performed well with regard to generalization error, indicating that local optimal solutions can be practically satisfactory.…”

Section: Definition 4 (S-stationarity)mentioning

confidence: 99%

See 2 more Smart Citations

Bilevel model selection for support vector machines

Kunapuli¹,

Bennett²,

Hu³

et al. 2008

CRM Proceedings and Lecture Notes

Self Cite

View full text Add to dashboard Cite

Abstract. The successful application of Support Vector Machines (SVMs), kernel methods and other statistical machine learning methods requires selection of model parameters based on estimates of the generalization error. This paper presents a novel approach to systematic model selection through bilevel optimization. We show how modelling tasks for widely used machine learning methods can be formulated as bilevel optimization problems and describe how the approach can address a broad range of tasks-among which are parameter, feature and kernel selection In addition, we also discuss the challenges in implementing these approaches and enumerate opportunities for future work in this emerging research area.

show abstract

Bilevel Optimization: Theory, Algorithms, Applications and a Bibliography

Dempe

2020

Springer Optimization and Its Applications

View full text Add to dashboard Cite

Classification model selection via bilevel programming

Cited by 66 publications

References 35 publications

Modeling Choices for Virtual Screening Hit Identification

Modeling Choices for Virtual Screening Hit Identification

Bilevel model selection for support vector machines

Bilevel Optimization: Theory, Algorithms, Applications and a Bibliography

Contact Info

Product

Resources

About