“…In detail, the CNN-based model is derived via the neutral network, comprising the convolution, pooling, and fully connected layers, requiring extensive 2D data (i.e., gridded data) as datasets (e.g., the image and videos) for the model training and application [3,4,15]; accordingly, the CNN model can efficiently provide the single model output from the grid-format model inputs. That is to say, the CNN-based model is advantageous concerning the 2D flood simulations for predicting or estimating the spatiotemporal flood-related variates (e.g., the inundation depths and corresponding area) [9]. The CNN-derived models have been proven to be efficiently implemented in the 2D flood forecast with the gridded inputs (such as the grid-based radar precipitation); however, without the mixed data, including the single dataset (e.g., at-site water level) and grid-format data under high-performance computing servers [3], it might cause the extensive computation time in the model training and application.…”