Predicting water levels in urban storm-water sewer systems is a critical study that could provide vital information to help reduce the risk of flooding. This study proposed a new ensemble model based on the integration of a meta-learner model, residual-error corrections, and a multiple-output framework. To achieve the meta-learner model, three multiple-output data-driven-based (MOD) sewer flooding models employing support vector regression (SVR), k-nearest neighbor regression (KNR), and categorical gradient boosting regression (CGBR) techniques were constructed and applied to predict the short-duration evolution of water levels at seven storm-water gauging sites in Taipei city, Taiwan, considering 10-min datasets spanning nearly 6 years (2016–2021). The Bayesian optimization algorithm was utilized in the training phases for all the models to avoid overfitting or underfitting. Enhancing the analysis of feature importance was also conducted to explore model interpretability based on the SHapley Additive exPlanation (SHAP) algorithm. The outputs of storm-water management model (SWMM) were used as benchmark solutions. For the model validation phase, the proposed integrated model improved the lead-time-averaged Nash–Sutcliffe efficiency of single KNR, SVR, and CGBR models by 174.5, 42.4, and 69.4%, respectively, showing that the proposed accurate model could be useful for urban flood warning systems.