Weighted Support Vector Machines with the SCAD Penalty

Jung, Kang-Mo

doi:10.5351/csam.2013.20.6.481

Cited by 5 publications

(5 citation statements)

References 16 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…where V is the weight for the th observation with the th class. A weighted SVM is proposed for the robustness of the SVM which is not sensitive to outlier or leverage points (see [13]). We consider the weight for each class as…”

Section: Svm For Unbalanced Casesmentioning

confidence: 99%

See 1 more Smart Citation

Support Vector Machines for Unbalanced Multicategory Classification

Jung

2015

Mathematical Problems in Engineering

Self Cite

View full text Add to dashboard Cite

Classification is a very important research topic and its applications are various, because data can be easily obtained in these days. Among many techniques of classification the support vector machine (SVM) is widely applied to bioinformatics or genetic analysis, because it gives sound theoretical background and its performance is superior to other methods. The SVM can be rewritten by a combination of the hinge loss function and the penalty function. The smoothly clipped absolute deviation penalty function satisfies desirably statistical properties. Since standard SVM techniques typically treat all classes equally, it is not well suited to unbalanced proportion data. We propose a robust method to treat unbalanced cases based on the weights of the class. Simulation and a numerical example show that the proposed method is effective to analyze unbalanced proportion data.

show abstract

Section: Svm For Unbalanced Casesmentioning

confidence: 99%

“…The within group errors can be calculated as the misclassification rate for the th class. Weight (13) gives much more weights on the minority class and the well-classified group got the less weight. The larger values of | (x )| in (13) represent well-classified observations.…”

Section: Svm For Unbalanced Casesmentioning

confidence: 99%

Support Vector Machines for Unbalanced Multicategory Classification

Jung

2015

Mathematical Problems in Engineering

Self Cite

View full text Add to dashboard Cite

show abstract

“…Some work has been done to extend the smoothly clipped absolute deviation (SCAD) penalty with weighted linear SVMs with special forms of such weights (see Jung (2013)), but, beyond that, there has not been any targeted investigation of such to our knowledge. These two reasons make these explorations vitally important.…”

Section: Introductionmentioning

confidence: 99%

“…However, it is worth noting that our setting differs from a simple classification format in two vital aspects: (a) although the treatment selection objective can be rewritten into a (weighted) classification problem (as shown in Section 2), it is still in essence a fundamentally different problem from classification, and feature selection techniques in SVMs have not been studied under this context, and (b) weighted SVM is a more complicated optimization problem than the standard SVM, where the constraint on each support vector varies according to the weight associated with it, and research into feature extraction under this setting has also been fairly limited till now. Some work has been done to extend the SCAD penalty with the weighted linear support vector machines with special forms of such weights (see Jung, 2013), but beyond that, there hasn't been any targeted investigation of such as per our knowledge. These two reasons make these explorations vitally important.…”

Section: Introductionmentioning

confidence: 99%

Selecting Biomarkers for Building Optimal Treatment Selection Rules by Using Kernel Machines

Dasgupta

Huang

2019

Journal of the Royal Statistical Society Series C: Applied Statistics

View full text Add to dashboard Cite

Summary Optimal biomarker combinations for treatment selection can be derived by minimizing the total burden to the population caused by the targeted disease and its treatment. However, when multiple biomarkers are present, including all in the model can be expensive and can hurt model performance. To remedy this, we consider feature selection in optimization by minimizing an extended total burden that additionally incorporates biomarker costs. Formulating it as a 0‐norm penalized weighted classification, we develop various procedures for estimating linear and non‐linear combinations. Through simulations and a real data example, we demonstrate the importance of incorporating feature selection and marker cost when deriving treatment selection rules.

show abstract

“…One alternative is the least absolute deviation (LAD) estimate. Jung (2011Jung ( , 2013 proposed robust estimators and outlier detection methods in regression models and support vector machine. There are several robust versions of LASSO.…”

Section: Introductionmentioning

confidence: 99%

Penalized rank regression estimator with the smoothly clipped absolute deviation function

Park

Jung

2017

CSAM

Self Cite

View full text Add to dashboard Cite

The least absolute shrinkage and selection operator (LASSO) has been a popular regression estimator with simultaneous variable selection. However, LASSO does not have the oracle property and its robust version is needed in the case of heavy-tailed errors or serious outliers. We propose a robust penalized regression estimator which provide a simultaneous variable selection and estimator. It is based on the rank regression and the nonconvex penalty function, the smoothly clipped absolute deviation (SCAD) function which has the oracle property. The proposed method combines the robustness of the rank regression and the oracle property of the SCAD penalty. We develop an efficient algorithm to compute the proposed estimator that includes a SCAD estimate based on the local linear approximation and the tuning parameter of the penalty function. Our estimate can be obtained by the least absolute deviation method. We used an optimal tuning parameter based on the Bayesian information criterion and the cross validation method. Numerical simulation shows that the proposed estimator is robust and effective to analyze contaminated data.

show abstract

Weighted Support Vector Machines with the SCAD Penalty

Cited by 5 publications

References 16 publications

Support Vector Machines for Unbalanced Multicategory Classification

Support Vector Machines for Unbalanced Multicategory Classification

Selecting Biomarkers for Building Optimal Treatment Selection Rules by Using Kernel Machines

Penalized rank regression estimator with the smoothly clipped absolute deviation function

Contact Info

Product

Resources

About