“…Since 2017, it has been demonstrated that deep learning (DL) networks (Shen, 2018; Shen et al., 2018), like long short‐term memory (LSTM; Hochreiter & Schmidhuber, 1997), can learn to predict environmental variables including stream temperature with exceptionally high accuracy (Rahmani, Lawson, et al., 2021; Rahmani, Shen, et al., 2021; Rehana & Rajesh, 2023; Sadler et al., 2022; Weierbach et al., 2022; S. Zhu & Piotrowski, 2020; Zwart et al., 2023). LSTM has shown its strength and versatility in simulating hydrologic variables such as soil moisture (Fang et al., 2017, 2019; J. Liu et al., 2022; O & Orth, 2021), streamflow (Feng et al., 2020, 2021; Khoshkalam et al., 2023; Xiang et al., 2020), dissolved oxygen (Heddam et al., 2022; Zhi et al., 2023), snow water equivalent (Broxton et al., 2019; Meyal et al., 2020), stream nitrate concentration (Saha et al., 2023; Samarinas et al., 2020), and radiation (Y. Liu et al., 2020; F. Zhu et al., 2021). However, mostly used as a forward simulator in hydrology, LSTM is also limited by its black‐box nature: it does not provide an interpretable explanation of internal processes, and its intermediate variables lack physical meaning and thus cannot be compared against observations to diagnose the model's internal logic (Appling et al., 2022).…”