Large‐scale day‐ahead wind power forecasting (WPF) for wind farm clusters (WFCs) can enable dispatching agencies to formulate scientifically sound power generation plans and enhance the robustness of power grids. Most available WPF methods for WFCs only involve mathematical models and rarely consider spatial correlation factors. This necessitates further improvements to forecasting systems. In this study, to increase the day‐ahead WPF accuracy for WFCs, fractal transform theory is introduced to optimize the process of WPF for WFCs through spatial upscaling and establish a day‐ahead WPF model for WFCs based on an improved spatial upscaling method. First, a WFC is partitioned into subclusters. Then, using fractal transform theory, an affine relation is established between the local output of each subcluster and the overall output of the WFC. Finally, a day‐ahead power forecast is produced for the WFC through spatial upscaling of the forecast for the subcluster with the highest grey relational grade with the output of the WFC. The applicability of the proposed forecasting model is examined using historical measured data for a large WFC in north‐eastern China. The case study results show that the proposed forecasting model outperforms the summation method and the statistical upscaling method in terms of forecasting accuracy.