As an emerging geophysical exploration technology in recent years, airborne electromagnetic exploration has the advantages of adapting to diverse terrains, wide coverage, and providing a large amount of electromagnetic data, and can be applied to the rapid collection of large amounts of data. Scenarios are often used in fields such as deep geological structures, mineral resource exploration, and environmental engineering research. However, traditional airborne electromagnetic data inversion technology usually takes a long time to process a large amount of airborne electromagnetic data, and it is difficult to remove the noise in the later signals. Therefore, this paper proposes a multi-task learning network structure based on Transformer. By constraining the two network branches of imaging and denoising, a sub-network with simultaneous denoising and imaging is established to process aeronautical electromagnetic data. The noise test set is introduced for testing. This model achieved a 582.61% signal-to-noise ratio improvement in smooth Gaussian noise denoising, and a 129.69% and 112.74% signal-to-noise ratio improvement in non-smooth Gaussian noise and random impulse noise denoising, respectively. The method proposed in this article overcomes the shortcomings of traditional inversion imaging such as slow speed and low resolution, and at the same time eliminates the influence of noise in airborne electromagnetic data. This is of great significance for the application of deep learning in the field of geophysical exploration.