Data-driven classifications are improving statistical power and refining prognoses for a range of respiratory, infectious, autoimmune, and neurological diseases. Studies have used molecular information, age of disease incidence, and sequences of disease onset (“disease trajectories”). Here we consider whether easily measured risk factors such as height and BMI can usefully characterise diseases in UK Biobank data, combining established statistical methods in new but rigorous ways to provide clinically relevant comparisons and clusters of disease. Over 400 common diseases were selected for study on the basis of clinical and epidemiological criteria, and a conventional proportional hazards model was used to estimate associations with 12 established risk factors. Comparing men and women, several diseases had strongly sex-dependent associations of disease risk with BMI. Despite this, a large proportion of diseases affecting both sexes could be identified by their risk factors, and equivalent diseases tended to cluster adjacently. This included 10 diseases presently classified as “Symptoms, signs, and abnormal clinical and laboratory findings, not elsewhere classified”. Many clusters are associated with a shared, known pathogenesis, others suggest likely but presently unconfirmed causes. The specificity of associations and shared pathogenesis of many clustered diseases, provide a new perspective on the interactions between biological pathways, risk factors, and patterns of disease such as multimorbidity.