Monthly extreme precipitation (EP) forecasts are of vital importance in water resources management and storage behind dams. Machine learning (ML) is extensively used for forecasting monthly EP, and improvements in model performance have been a popular issue. The innovation of this study is summarized as follows. First, a distance correlation-Pearson correlation (DC-PC) method was proposed to identify the complex nonlinear relationship between global sea surface temperature (SST) and EP and select key input factors from SST. Second, a random forest (RF) model was used for forecasting monthly EP, and the physical mechanism of EP was obtained based on the feature importance (FI) of RF and DC–PC relationship. The middle and lower reaches of the Yangtze River (MLYR) were selected as a case study, and monthly EP in summer (June, July and August) was forecasted. Furthermore, the physical mechanism between key predictors with a large proportion of FI and EP was investigated. Results showed that the proposed model had high accuracy and robustness, in which R2 in the test period was above 0.81, and RMSE as well as MAE were below 10 mm. Meanwhile, the key predictors in the high SST years could cause eastward extension of the South Asian High, westward extension of the Western Pacific Subtropical High, water vapor rising motion and an increase in the duration of atmospheric rivers exceeding 66 h, which lead to increasing EP in the MLYR. The results indicated that the DC–PC method could replace Pearson correlation for investigating the nonlinear relationship between SST and EP, as well as for selecting the factors. Further, the key predictors that account for a large proportion of FI can be used for explaining the physical mechanism of EP and directing forecasts.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.