Objective: To explore the utility of cluster analysis in de®ning complex dietary exposures, separately with two types of variables. Design: A modi®ed diet history method, combining a 7-day menu book and a 168-item questionnaire, assessed dietary habits. A standardized questionnaire collected information on sociodemographics, lifestyle and health history. Anthropometric information was obtained through direct measurements. The dietary information was collapsed into 43 generic food groups, and converted into variables indicating the per cent contribution of speci®c food groups to total energy intake. Food patterns were identi®ed by the QUICK CLUSTER procedure in SPSS, in two separate analytical steps using unstandardized and standardized (Z-scores) clustering variables. Setting: The Malmo È Diet and Cancer (MDC) Study, a prospective study in the third largest city of Sweden, with baseline examinations from March 1991 to October 1996. Subjects: A random sample of 2206 men and 3151 women from the MDC cohort (n = 28 098). Results: Both variable types produced conceptually well separated clusters, con®rmed with discriminant analysis.`Healthy' and`less healthy' food patterns were also identi®ed with both types of variables. However, nutrient intake differences across clusters were greater, and the distribution of the number of individuals more even, with the unstandardized variables. Logistic regression indicated higher risks of past food habit change, underreporting of energy and higher body mass index (BMI) for individuals falling into`healthy' food pattern clusters. Conclusions: The utility in discriminating dietary exposures appears greater for unstandardized food group variables. Future studies on diet and cancer need to recognize the confounding factors associated with`healthy' food patterns.
Keywords
Dietary patternsFood patterns Dietary exposure categories Nutrient density Standardization Z-scores Diet history EpidemiologyFoods are consumed in a number of combinations, providing a range of nutrients and other dietary factors, which interact in very complex ways. Credible hypotheses have been formulated linking diet to cancer, either through promotion of, or protection against, cancer development 1±4 . However, nutrition studies have dif®-culties in separating the effects of individual nutrients, because food sources often are the same for many nutrients (e.g. for energy and fat, for different types of fat and for plant food constituents) resulting in highly correlated variables 3±5 . Strong correlations between variables (multicolinearity) result in unstable relationships, and may cause attenuation of diet±disease relations if the variables are entered simultaneously into the same model. Reviewers have pointed out that it is dif®cult both on practical and theoretical grounds to isolate the speci®c cancer-related biological activities of single nutrients or chemicals in observational epidemiology studies, and that integrative approaches taking into account multiple dietary factors are needed 3,4 . Multiv...