“…Due to the powerful nonlinear fitting capabilities, neural networks have been widely applied in infrared and visible image fusion, achieving performance far superior to traditional methods. Currently, the methods of infrared and visible image fusion based on deep learning can generally be divided into four types: CNN-based methods [ 33 , 34 , 35 , 36 ], GAN-based methods [ 37 , 38 , 39 , 40 , 41 ], AE-based [ 1 , 42 , 43 , 44 , 45 ] methods, and transformer-based [ 32 , 46 , 47 , 48 , 49 ] methods. CNN-based methods tend to focus on the design of loss functions, forcing the model to generate images that contain as much information from the source images as possible.…”