The blood count is one of the most common tests used for health assessment. In elderly individuals, selection of a 'healthy' reference population for laboratory assessment is difficult due to the high prevalence of chronic morbidities, leading to uncertainty regarding appropriate reference intervals. In particular, age-specific lower haemoglobin reference limits to define anaemia are controversial. Here, we applied a data mining approach to a large dataset of 3 029 904 clinical routine samples to establish blood count reference intervals. We excluded samples from units/specialists with a high proportion of abnormal blood counts, samples from patients with an unknown or decreased estimated glomerular filtration rate, and samples with abnormal test results in selected other analytes. After sample exclusion, 566 775-572 060 samples from different individuals aged 20-100 years were available for analysis. We then used an established statistical algorithm to determine the distribution of physiological test results and calculated age-and sex-specific reference intervals. Our results show substantial trends with age in haematology analytes' reference intervals. Most notably, haemoglobin and red cell counts decline in men with advanced age, accompanied by increases in red cell volume in both sexes. These findings were confirmed in an independent dataset, and suggest an at least partly physiologic cause.The blood count is one of the most common tests for health assessment with diagnostic and therapeutic implications for a multitude of common to rare and minor to life-threatening conditions. However, despite its frequent clinical use and importance and near-universal availability on the one hand, and an ageing population throughout the world on the other hand, uncertainty regarding appropriate reference intervals for older individuals still exists. In particular, age-specific lower reference limits for haemoglobin, which define anaemia, are controversial, despite substantial consequences for individual patients and considerable public health impact.A major challenge when establishing reference intervals for individuals >50 years is the increasing proportion of chronic morbidities and medication with age, which leads to exclusion of these subjects from conventional reference interval studies (Adeli et al., 2015;R€ ohrig et al., 2018). Based on these restrictions, data mining of age-specific reference intervals using laboratory test results collected from routine patient care can be considered a viable complement to conventional reference interval studies (Haeckel et al., 2017;Jones et al., 2018). In contrast to the highly selected population of conventional approaches, a reference interval data mining approach uses exactly the 'real world' population to which the reference intervals are ultimately applied, while still excluding outliers. In addition, the necessity to obtain a patient history and perform a clinical examination restricts the number of individuals who can be recruited for a conventional reference sample, while...