Due to the significant negative consequences of winter cold extremes, there is need to better understand and simulate the mechanisms driving their occurrence. The impact of atmospheric blocking on winter cold spells over North America is investigated using ERA-Interim and NCEP-DOE-R2 reanalyses for 1981–2010. Initial-condition large-ensembles of two generations of Canadian Earth System Models (CanESM5 and its predecessor, CanESM2) are evaluated in terms of their ability to represent the blocking-cold spell linkage and the associated internal-variability. The reanalysis datasets show that 72 and 58% of cold spells in southern and northern North America coincide with blocking occurring in the high-latitude Pacific-North America. Compared to the two reanalyses, CanESM2 and CanESM5 ensembles underestimate by 19.9 and 14.3% cold spell events coincident with blocking, due to significant under-representation of blocking frequency over the North Pacific (− 47.1 and − 29.0%), whereas biases in cold spell frequency are relatively small (6.6 and − 4.7%). In the reanalyses, regions with statistically significant above-normal cold spell frequency relative to climatology lie on the east and/or south flanks of blocking events, whereas those with below-normal frequency lie along the core or surrounding the blocking. The two ensembles reproduce the observed blocking-cold spell linkage over North America, despite underestimating the magnitude of blocking frequency. The two ensembles also reproduce the physical drivers that underpin the blocking-cold spell linkage. Spatial agreement with the reanalyses is found in the simulated patterns of temperature advection and surface heat flux forcing anomalies during blocking events. While CanESM5 shows an improved representation of the blocking climatology relative to CanESM2, both yield similar results in terms of the blocking-cold spell linkage and associated internal-variability.