“…Comparing the performance of different models can help both practitioners and developers to improve existing models by learning from other modeling concepts (Kollet et al, 2017) or calibration approaches (e.g., Freyberg, 1988). This is commonly mentioned as a reason why hydrologists should be interested in machine learning models (e.g., Haaf et al, 2023;Nolte et al, 2023;Kratzert et al, 2019), as they may result in new knowledge that in turn may be used to improve empirical and process-based groundwater models. Several studies have compared models to simulate head time series (e.g., Sahoo and Jha, 2013;Shapoori et al, 2015;Wunsch et al, 2021;Zarafshan et al, 2023;Vonk et al, 2024).…”