Runoff forecasting is crucial for water resource management and flood safety and remains a central research topic in hydrology. Recent advancements in machine learning provide novel approaches for predicting runoff. This study employs the Competitive Adaptive Reweighted Sampling (CARS) algorithm to integrate various machine learning models into a data-driven rainfall–runoff simulation model. We compare the forecasting performance of different machine learning models to improve rainfall–runoff prediction accuracy. This study uses data from the Maduwang hydrological station in the Bahe river basin, which contain 12 measured flood events from 2000 to 2010. Historical runoff and areal mean rainfall serve as model inputs, while flood data at different lead times are used as model outputs. Among the 12 flood events, 9 are used as the training set, 2 as the validation set, and 1 as the testing set. The results indicate that the CARS-based machine learning model effectively forecasts floods in the Bahe River basin. Under the prediction period of 1 to 6 h, the model achieves high forecasting accuracy, with the average NSE ranging from 0.7509 to 0.9671 and the average R2 ranging from 0.8397 to 0.9413, though the accuracy declines to some extent as the lead time increases. The model accurately predicts peak flow and performs well in forecasting high flow and recession flows, though peak flows are somewhat underestimated for longer lead times. Compared to other machine learning models, the SVR model has the highest average RMSE of 0.942 for a 1–6 h prediction period. It exhibits the smallest deviation among low-, medium-, and high-flow curves, with the lowest NRMSE values across training, validation, and test sets, demonstrating better simulation performance and generalization capability. Therefore, the machine learning model based on CARS feature selection can serve as an effective method for flood forecasting. The related findings provide a new forecasting method and scientific decision-making basis for basin flood safety.