Crop models are instrumental in simulating resource utilization in agriculture, yet their complexity necessitates extensive calibration, which can impact the accuracy of yield predictions. Machine learning shows promise for enhancing yield estimations but relies on vast amounts of training data. This study aims to improve the pakchoi yield prediction accuracy of simulation models. We developed a stacking ensemble learning model that integrates three base models—EU-Rotate_N, Random Forest Regression and Support Vector Regression—with a Multi-layer Perceptron as the meta-model for the pakchoi dry matter yield prediction. To enhance the training dataset and bolster machine learning performance, we employed the EU-Rotate_N model to simulate daily dry matter yields for unsampled data. The test results revealed that the stacking model outperformed each base model. The stacking model achieved an R² value of 0.834, which was approximately 0.1 higher than that of the EU-Rotate_N model. The RMSE and MAE were 0.283 t/ha and 0.196 t/ha, respectively, both approximately 0.6 t/ha lower than those of the EU-Rotate_N model. The performance of the stacking model, developed with the expanded dataset, showed a significant improvement over the model based on the original dataset.