2007
DOI: 10.1007/s11634-007-0009-9
|View full text |Cite
|
Sign up to set email alerts
|

Variable selection in discriminant analysis based on the location model for mixed variables

Abstract: Non-parametric smoothing of the location model is a potential basis for discriminating between groups of objects using mixtures of continuous and categorical variables simultaneously. However, it may lead to unreliable estimates of parameters when too many variables are involved. This paper proposes a method for performing variable selection on the basis of distance between groups as measured by smoothed Kullback-Leibler divergence. Searching strategies using forward, backward and stepwise selections are outli… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
18
0
2

Year Published

2011
2011
2018
2018

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 15 publications
(20 citation statements)
references
References 29 publications
0
18
0
2
Order By: Relevance
“…The simulation results clearly show their impact on the performance, and we propose a method based on leave-one-out cross validation for obtaining optimal results. When using this appoach, the obtained results show that the proposal is competitive with that of Mahat et al (2007). All the proofs are given in Section 6.…”
mentioning
confidence: 86%
See 1 more Smart Citation
“…The simulation results clearly show their impact on the performance, and we propose a method based on leave-one-out cross validation for obtaining optimal results. When using this appoach, the obtained results show that the proposal is competitive with that of Mahat et al (2007). All the proofs are given in Section 6.…”
mentioning
confidence: 86%
“…To overcome these problems, we propose in this section a non-parametric smoothing procedure for estimating the criterion (5). This procedure is based on smoothing as it is done in Aspakourov and Krzanowki (2000) and Mahat et al (2007). Denote by D the dissimilarity defined on {1, · · · , M } 2 by…”
Section: Non-parametric Smoothing Proceduresmentioning
confidence: 99%
“…This full breast cancer data contains 15 variables comprising of 2 continuous variables, 4 nominal variables with three states each, 6 ordinal variables with eleven states each and 3 binary variables. All the ordinal variables are treated as continuous variables while all the nominal variables are transformed into binary variables according to the past studies (Hamid 2014;Krzanowski 1975;Mahat et al 2007). This pre-processing gives a new dimension with eight continuous and eleven binary variables of a full breast cancer data.…”
Section: Simulation and Real Datasetsmentioning
confidence: 99%
“…In particular, the continuous variables are utilized to estimate a set of parameters required in each multinomial cells created by the categorical variables. Based on the natural structure of cLM, all categorical variables need to be converted into binary structure in order to create segmentation called multinomial cells (Krzanowski 1993(Krzanowski , 1983(Krzanowski , 1975Mahat et al 2007). These multinomial cells play an important role in discriminating a new observation into group correctly (Asparoukhov & Krzanowski 2000;Krzanowski 1995).…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation