BackgroundLinkage of demographic, health, and developmental administrative data can enrich population-based surveillance and research on developmental and educational outcomes. Transparency of the record linkage process and results are required to assess potential biases.
ObjectivesTo describe the approach used to link records of kindergarten children from the Early Development Instrument (EDI) in Ontario to health administrative data and test differences in characteristics of children by linkage status. We demonstrate how socio-demographic and medical risk factors amass in their contribution to early developmental vulnerability and test the concordance of health diagnoses in both the EDI and health datasets of linked records.
MethodsChildren with records in the 2015 EDI cycle were deterministically linked to a population registry in Ontario, Canada. We compared sociodemographic and developmental vulnerability data between linked and unlinked records. Among linked records, we examined the contribution of medical and social risk factors obtained from health administrative data to developmental vulnerability identified in the EDI using descriptive analyses.
ResultsOf 135,937 EDI records, 106,217 (78.1%) linked deterministically to a child in the Ontario health registry using birth date, sex, and postal code. The linked cohort was representative of children who completed the EDI in age, sex, rural residence, immigrant status, language, and special needs status. Linked data underestimated children living in the lowest neighbourhood income quintile (standardized difference [SD] 0.10) and with higher vulnerability in physical health and well-being (SD 0.11) , social competence (SD 0.10), and language and cognitive development (SD 0.12). Analysis of linked records showed developmental vulnerability is sometimes greater in children with social risk factors compared to those with medical risk factors. Common childhood conditions with records in health data were infrequently recorded in EDI records.
ConclusionsLinkage of early developmental and health administrative data, in the absence of a single unique identifier, can be successful with few systematic biases introduced. Cross-sectoral linkages can highlight the relative contribution of medical and social risk factors to developmental vulnerability and poor school achievement.