IntroductionAlthough other methods of organising data have been used , unsupervised clustering has been widely employed in analysing vegetation data. Most such analyses have used hierarchical clustering methods; for example, the widespread Braun-Blanquet method (Westhoff and van der Maarel 1978) is formidably hierarchical in its approach. Whatever the a priori likelihood that vegetation falls neatly into the nested clusters demanded by such a model, it is surely more appropriate to test if a hierarchy does provide a better model of the data than alternatives, such as Galois lattices (Rodin et al. 1998) or Gaussian response curves (ter Braak and Prentice 1988). This, of course, requires that we have some means of measuring the quality of a model. The Minimum Message Length (MML) principle (Wallace and Dowe 2000) provides just such a measure; the shorter the message length, the higher the prior probability of the model. (2000) SNOB. This provides a general regionalisation using a nonhierarchical clustering. A variant of the program incorporates possible spatial correlations (Wallace 1998) and thereby encourages spatial contiguity of cluster members but this was not used. The program does not provide a segmentation (cf. Oliver et al. 1998) with crisp boundaries for clusters. Instead it employs a fuzzy assignment of things to clusters; such fuzziness is necessary to obtain consistent estimates of cluster parameters.Boulton and Wallace (1973a,b) presented a method (HSNOB) using MML estimation for hierarchical clustering. This means that we can actually make a comparison of non-hierarchic and hierarchic analyses based on the message lengths. In this paper we propose to first examine the concept of hierarchy, and then consider possible reasons why vegetation might have such structure. We shall then examine the application of HSNOB and compare the hierarchical and non-hierarchical solutions to determine
Hierarchical clusters of vegetation types
Abstract:In this paper, we examine possible sources of hierarchical (nested) structure in vegetation data. We then use the Minimum Message length principle to provide a rational means of comparing hierarchical and non-hierarchical clustering. The results indicate that, with the data used, a hierarchical solution was not as efficient as a nonhierarchical one. However, the hierarchical solution seems to provide a more comprehensible solution, separating first isolated types, probably caused from unusual contingent events, then subdividing the more diverse areas before finally subdividing the less diverse. By presenting this in 3 stages, the complexity of the non-hierarchical result is avoided. The result also suggests that a hierarchical analysis may be useful in determining 'homogeneous' areas.Abbreviatons: MML -Minimum Message Length; MUAP -Modifiable unit area problem.