“…What is more, reinforcement learning, a vital branch of machine learning, is capable of ensuring anti-jamming communications based on joint actions executed by the WSN and the feedback from the external environment, without the need for jamming modeling [ 8 ]. Anti-jamming communication schemes based on reinforcement learning have already been extensively studied [ 2 , 7 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 ]. In reference [ 7 ], a SARSA-based anti-jamming algorithm was proposed to counter random pulse jamming in a time domain; however, most of the current research focuses on interference patterns with frequency domain characteristics.…”