“…The conventional RL has certain limitations when dealing with anti‐jamming systems with a large state‐action space. To tackle these challenges, some recent work has proposed using deep reinforcement learning (DRL) [52, 53, 95–100] as shown in Table 4. The DRL is a branch of DL where it uses deep artificial neural networks to enhance the learning operation of the traditional RL.…”