Bootstrap Aggregation and Cross‐Validation Methods to Reduce Overfitting in Reservoir Control Policy Search

Brodeur, Zachary Paul; Herman, Jonathan D.

doi:10.1029/2020wr027184

Cited by 33 publications

(16 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…This approach can overfit the policy parameters to the particular stochastic realizations experienced during the simulation‐based optimization, potentially yielding impressive calibration results that can largely degrade when tested on out‐of‐sample observations. This issue can be overcome by splitting the available observations (or an ensemble of synthetically generated records) into two statistically equivalent data sets to optimize the policy parameters on the first set of data and test policy performance on the second data set (Brodeur et al., 2020). Another often overlooked aspect of DPS is the a priori definition of the policy architecture, generally based on intuition, analytical methods, or on few trial‐and‐error experiments.…”

Section: Beyond Stochastic Dynamic Programmingmentioning

confidence: 99%

A State‐of‐the‐Art Review of Optimal Reservoir Control for Managing Conflicting Demands in a Changing World

Giuliani

Lamontagne

Reed

et al. 2021

Water Resources Research

View full text Add to dashboard Cite

The state of the art for optimal water reservoir operations is rapidly evolving, driven by emerging societal challenges. Changing values for balancing environmental resources, multisectoral human system pressures, and more frequent climate extremes are increasing the complexity of operational decision making. Today, reservoir operations benefit from technological advances, including improved monitoring and forecasting systems as well as increasing computational power. Past research in this area has largely focused on improving solution algorithms within the limits of the available computational power, using simplified problem formulations that can misrepresent important systemic complexities and intersectoral interactions. In this study, we review the recent literature focusing on how the operation design problem is formulated, rather than solved, to address existing challenges and take advantage of new opportunities. This paper contributes a comprehensive classification of over 300 studies published over the last years into distinctive categories depending on the adopted problem formulation, which clarifies consolidated methodological approaches and emerging trends. Our analysis also suggests that control policy design methods may benefit from broadening the types of information that is used to condition operational decisions, and from using emulation modeling to identify low‐order, computationally efficient surrogate models capturing realistic representations of river basin systems' complexity in order to isolate key decision‐relevant processes. These advances in reservoir operations hold significant promise for better addressing the challenges of conflicting human pressures and a changing world, which is particularly important, given the renewed interest in dam construction globally.

show abstract

Section: Beyond Stochastic Dynamic Programmingmentioning

confidence: 99%

A State‐of‐the‐Art Review of Optimal Reservoir Control for Managing Conflicting Demands in a Changing World

Giuliani

Lamontagne

Reed

et al. 2021

Water Resources Research

View full text Add to dashboard Cite

show abstract

“…To mitigate this limitation, previous studies have been employed to optimize control policies by using sufficient training sequences from synthetically generated scenarios (de la Cruz Courtois et al, 2021;Salazar et al, 2017;Tsoukalas and Makropoulos, 2015). Other works have adopted an additional sequence to improve optimized control policies over other synthetic scenarios not used in training (Brodeur et al, 2020;Quinn et al, 2017).…”

Section: Introductionmentioning

confidence: 99%

Bagging-based Adaptive Synthetic Oversampling Algorithm for Robust Policy Search in Reservoir Management

Ahn

2022

Preprint

View full text Add to dashboard Cite

Reliable policy search is essential in improving reservoir operations to satisfy multi-sectoral needs such as flood control and water supply. Given its importance, this topic has been widely explored in reservoir control studies. However, previous studies have observed that optimized policies tend to overfit to the training data, and are thus prone to be controlled mainly by infrequent extreme samples in the training data. This study proposes a bootstrap aggregation (bagging)-based Adaptive Synthetic (ADASYN) algorithm as an extension of the ADASYN and bagging techniques originated by machine learning literature. We illustrate the effectiveness of the bagging-based ADASYN algorithm using a case study of the Folsom Reservoir in Northern California with a binary tree-based control policy. The proposed algorithm variants are also developed to confirm the usefulness of the individual technique embedded in the final procedure. Results demonstrate that the proposed algorithm yields significant improvements in managing water supply and flood risks. In the proposed algorithm, the ADASYN technique facilitates creating a reliable set of policy trees while generating synthetic samples in reservoir inflow to augment infrequent extreme samples. Moreover, the bagging technique is beneficial in selecting the final policy tree while leading to improved out-of-sample performance. We conclude that this case study using the novel ADASYN algorithm highlights the potential to improve policy search algorithms by utilizing well-established training strategies from machine learning.

show abstract

“…This limitation extends to hydrologic forecasts forced by those climate hindcasts (Demargne et al., 2014). Thus, a fairly short time period of available hindcasts (at most ∼40 years) must be parsed into even smaller periods to enable calibration and testing of policies, creating the potential for overfitting and poor out‐of‐sample performance (Brodeur et al., 2020; Herman et al., 2020; Nayak et al., 2018).…”

Section: Introductionmentioning

confidence: 99%

“…This limitation extends to hydrologic forecasts forced by those climate hindcasts (Demargne et al, 2014). Thus, a fairly short time period of available hindcasts (at most ∼40 years) must be parsed into even smaller periods to enable calibration and testing of policies, creating the potential for overfitting and poor out-of-sample performance (Brodeur et al, 2020;Herman et al, 2020;Nayak et al, 2018).Synthetic forecasts offer a solution to overcome this challenge. Synthetic forecasts are generated by adding random error to observational records, such that the resulting series is statistically indistinguishable from forecasts developed using a physically based model.…”

mentioning

confidence: 99%

A Multivariate Approach to Generate Synthetic Short‐To‐Medium Range Hydro‐Meteorological Forecasts Across Locations, Variables, and Lead Times

Brodeur

2021

Water Resources Research

Self Cite

View full text Add to dashboard Cite

The use of hydro-meteorological forecasts in water resources management holds great promise as a soft pathway to improve system performance. Methods for generating synthetic forecasts of hydro-meteorological variables are crucial for robust validation of forecast use, as numerical weather prediction hindcasts are only available for a relatively short period (10-40 years) that is insufficient for assessing risk related to forecast-informed decision-making during extreme events. We develop a generalized error model for synthetic forecast generation that is applicable to a range of forecasted variables used in water resources management. The approach samples from the distribution of forecast errors over the available hindcast period and adds them to long records of observed data to generate synthetic forecasts. The approach utilizes the Skew Generalized Error Distribution (SGED) to model marginal distributions of forecast errors that can exhibit heteroskedastic, auto-correlated, and non-Gaussian behavior. An empirical copula is used to capture covariance between variables, forecast lead times, and across space. We demonstrate the method for medium-range forecasts across Northern California in two case studies for (1) streamflow and (2) temperature and precipitation, which are based on hindcasts from the NOAA/NWS Hydrologic Ensemble Forecast System (HEFS) and the NCEP GEFS/R V2 climate model, respectively. The case studies highlight the flexibility of the model and its ability to emulate space-time structures in forecasts at scales critical for water resources management. The proposed method is generalizable to other locations and computationally efficient, enabling fast generation of long synthetic forecast ensembles that are appropriate for risk analysis.

show abstract

Bootstrap Aggregation and Cross‐Validation Methods to Reduce Overfitting in Reservoir Control Policy Search

Cited by 33 publications

References 24 publications

A State‐of‐the‐Art Review of Optimal Reservoir Control for Managing Conflicting Demands in a Changing World

A State‐of‐the‐Art Review of Optimal Reservoir Control for Managing Conflicting Demands in a Changing World

Bagging-based Adaptive Synthetic Oversampling Algorithm for Robust Policy Search in Reservoir Management

A Multivariate Approach to Generate Synthetic Short‐To‐Medium Range Hydro‐Meteorological Forecasts Across Locations, Variables, and Lead Times

Contact Info

Product

Resources

About