1994
DOI: 10.1002/j.2333-8504.1994.tb01609.x
|View full text |Cite
|
Sign up to set email alerts
|

Variable Screening for Cluster Analysis

Abstract: Inclusion of irrelevant variables in a cluster analysis adversely affects subgroup recovery. This paper examines using moment-based statistics to screen variables; only variables which pass the screening are then used in clustering. Normal mixtures are analytically shown often to possess negative kurtosis. Two related measures, m and coefficient of bimodality b, are also 'examined.A Monte Carlo study compared the screening measures to no selection, De Soete's (1988) ultrametric weights, and Fowlkes, Gnanadesik… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
6
0

Year Published

1994
1994
2019
2019

Publication Types

Select...
4

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(6 citation statements)
references
References 40 publications
0
6
0
Order By: Relevance
“…Johnson (1967) and Milligan (1979) demonstrated the relationship between the ultrametric inequality and many commonly used hierarchical clustering algorithms, and Milligan and Isaac (1980) give simulation results which provide support for the utility of the conceptualization. Two studies (Milligan, 1989b, Donoghue, 1994a have examined De Soete's algorithm, and found that it greatly improved cluster recovery when the data contained one, two, or three irrelevant dimensions. Further studies to evaluate this method would clearly be useful.…”
Section: Alternative Proceduresmentioning
confidence: 99%
“…Johnson (1967) and Milligan (1979) demonstrated the relationship between the ultrametric inequality and many commonly used hierarchical clustering algorithms, and Milligan and Isaac (1980) give simulation results which provide support for the utility of the conceptualization. Two studies (Milligan, 1989b, Donoghue, 1994a have examined De Soete's algorithm, and found that it greatly improved cluster recovery when the data contained one, two, or three irrelevant dimensions. Further studies to evaluate this method would clearly be useful.…”
Section: Alternative Proceduresmentioning
confidence: 99%
“…Experts advise that, in performing cluster analysis, ''variables should only be included if there is good reason to think that they will define the clusters'' (Everitt, Landau, & Leese, 2001, p. 179). Variables that are not theoretically relevant should be omitted, because ''irrelevant variables are likely to 'swamp' the genuine differences of interest'' (Gordon, 1981, p. 30; see also Donoghue, 1994). If Veale et al sought to partition their participants into autogynephilic and nonautogynephilic clusters, they should have conducted an analysis using only Blanchard's Core Autogynephilia and Autogynephilic Interpersonal Fantasy scales, and omitted the irrelevant variables Attraction to Feminine Males and Attraction to Transgender Fiction.…”
mentioning
confidence: 99%
“…The relationship of the weighting methods identified here to the closely allied problem of variable selection (Milligan, 1989b;Donoghue, 1994a) remains to be addressed. Both of these papers found that the ultrametric weights were useful in ameliorating the degradation of cluster recovery caused by including irrelevant variables in the cluster analysis.…”
mentioning
confidence: 99%
“…Johnson (1967) and Milligan (1979) demonstrated the relationship between the ultrametric inequality and many commonly used hierarchical clustering algorithms, and Milligan and Isaac (1980) give simulation results which provide support for the utility of the conceptualization. Two studies (Milligan, 1989b, Donoghue, 1994a have examined De Soete's algorithm, and found that it greatly improved cluster recovery when the data contained "error" dimensions, i.e, dimensions which contained no information about subgroup membership. However, it is not known whether this method helps with the difficulties of within-group covariance structure.…”
mentioning
confidence: 99%
See 1 more Smart Citation