“…Several techniques for achieving autonomy in rescue operations have been developed in recent years, with reinforcement learning (RL) being one of the prominent methods. In recent years, RL techniques have achieved remarkable success across various domains, including network security, biological applications, and robotics (Alali and Imani, 2023 , 2024 ; Elguea-Aguinaco et al, 2023 ; Ravari et al, 2023 ; Alali et al, 2024 ; Asadi et al, 2024 ). For autonomy in rescue operations, various RL techniques have been developed for single-agent and multi-agent settings (Imanberdiyev et al, 2016 ; Zhang et al, 2018 ; Bøhn et al, 2019 ; Lin et al, 2019 ; Niroui et al, 2019 ; Sampedro et al, 2019 ; Ebrahimi et al, 2020 ; Hu et al, 2020 ; Wu et al, 2021 ).…”