2021
DOI: 10.1029/2020wr029471
|View full text |Cite
|
Sign up to set email alerts
|

Reliability Assessment of Machine Learning Models in Hydrological Predictions Through Metamorphic Testing

Abstract: The reliability of the machine learning model prediction for a given input can be assessed by comparing it against the actual output. However, in hydrological studies, machine learning models are often adopted to predict future or unknown events, where the actual outputs are unavailable. The prediction accuracy of a model, which measures its average performance across an observed data set, may not be relevant for a specific input. This study presents a method based on metamorphic testing (MT), adopted from sof… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 11 publications
(7 citation statements)
references
References 74 publications
0
7
0
Order By: Relevance
“…In a VHL, catchment characteristics can be assigned to cover a range of likely values, and VHL simulations used to create and test CMs that can then be more broadly applied. Testing the VHL could focus on behavioral modeling (Schaefli et al., 2011; Yang & Chui, 2021) to ensure that relationships between drivers of change and hydrologic responses are consistent with the four lines of evidence rather than using catchment specific performance metrics for single hydrologic variables (e.g., Nash Sutcliffe efficiency between observed and modeled flow) that rely heavily on having complete and accurate data sets to parameterize the model and define boundary conditions. Note that in this VHL approach, selection of suitable “trustworthy” PRMs for the particular change experiment of interest will be crucial—see Section 3.3 for further discussion.…”
Section: The Value Of “Virtual Hydrological Laboratories” As Another ...mentioning
confidence: 99%
“…In a VHL, catchment characteristics can be assigned to cover a range of likely values, and VHL simulations used to create and test CMs that can then be more broadly applied. Testing the VHL could focus on behavioral modeling (Schaefli et al., 2011; Yang & Chui, 2021) to ensure that relationships between drivers of change and hydrologic responses are consistent with the four lines of evidence rather than using catchment specific performance metrics for single hydrologic variables (e.g., Nash Sutcliffe efficiency between observed and modeled flow) that rely heavily on having complete and accurate data sets to parameterize the model and define boundary conditions. Note that in this VHL approach, selection of suitable “trustworthy” PRMs for the particular change experiment of interest will be crucial—see Section 3.3 for further discussion.…”
Section: The Value Of “Virtual Hydrological Laboratories” As Another ...mentioning
confidence: 99%
“…In the hydraulic field, MT has been successfully applied to test the storm water management model (SWMM) systems [19] and dynamical hydraulic systems model [40]. Lin et al [19] have proposed four MRs related to the change of correlation coefficients ( 2 ), which help detect real-life defects in the SWMM system that is used to simulate the dynamic rainfall-runoff models for estimating the storm water runoffs in urban areas.…”
Section: Related Workmentioning
confidence: 99%
“…Lin et al [19] have proposed four MRs related to the change of correlation coefficients ( 2 ), which help detect real-life defects in the SWMM system that is used to simulate the dynamic rainfall-runoff models for estimating the storm water runoffs in urban areas. The work of Yang and Chui [40] focuses on testing machine learning models which predict flood events in Germany. They have proposed three MRs and adopted them to test 13 machine learning models.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…We will do this by considering isolated changes in precipitation and temperature and compare the outlet discharge with expected outcomes for selected basins from the CAMELS data set (Newman et al, 2015;Addor et al, 2017) with different characteristics (low-elevation, warm catchments vs. very high elevation, snow-dominated catchments). Note that this is a metamorphic testing design (Xie et al, 2011;Yang and Chui, 2021) that facilitates the formulation of the qualitative expected behavior, rather than a realistic climate change scenario that would consist of coupled temperature and precipitation changes. Based on this design, the more specific goals of our study are to answer the following questions:…”
Section: Introductionmentioning
confidence: 99%