2014
DOI: 10.1177/1471082x14553366
|View full text |Cite
|
Sign up to set email alerts
|

Selection and Fusion of Categorical Predictors with L0-Type Penalties

Abstract: In regression modelling, categorical covariates have to be coded. Depending on the number of categorical covariates and on the number of levels they have, the number of coefficients can become huge. To reduce the model complexity, coefficients of similar categories should be fused and coefficients of non-influential categories should be set to zero. To this end, Lasso-type penalties on the differences of coefficients are a standard approach. However, the clustering/selection performance of this approach is som… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
8
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
5
1
1

Relationship

1
6

Authors

Journals

citations
Cited by 12 publications
(8 citation statements)
references
References 22 publications
0
8
0
Order By: Relevance
“…While this shrinks the largest and smallest of the estimated coefficients together, the remaining coefficients lying in the open interval between these are unpenalised and so no grouping of the estimates is encouraged, as we observe in Figure 2; see also Oelker et al (2015) for a discussion of this issue in the context of ordinal variables.…”
Section: Background and Motivationmentioning
confidence: 91%
See 1 more Smart Citation
“…While this shrinks the largest and smallest of the estimated coefficients together, the remaining coefficients lying in the open interval between these are unpenalised and so no grouping of the estimates is encouraged, as we observe in Figure 2; see also Oelker et al (2015) for a discussion of this issue in the context of ordinal variables.…”
Section: Background and Motivationmentioning
confidence: 91%
“…the absolute value | jk + jl | by (| jk + jl | ) where ρ is a concave and non-decreasing penalty function (Ma & Huang, 2017;Oelker et al, 2015). However, this does not address the basic issue of a preference for groups of unequal sizes.…”
Section: Background and Motivationmentioning
confidence: 99%
“…Whilst this shrinks the largest and smallest of the estimated coefficients together, the remaining coefficients lying in the open interval between these are unpenalised and so no grouping of the estimates is encouraged, as we observe in Figure 1; see also Oelker et al [2015] for a discussion of this issue in the context of ordinal variables.…”
Section: Overview Of Our Contributionsmentioning
confidence: 94%
“…The implementation of robust Haebara linking in the sirt [36] package specifies a sequence of decreasing values of ε in the optimization, each using the previous solution as initial values (see [37] for a similar approach). It should be noted that alternative differentiable approximating function for the loss function ρ(x) = |x| p for values p nearby zero have been employed [38].…”
Section: Estimationmentioning
confidence: 99%