For the pulse shaping system of the SG-II-up facility, we propose a U-shaped convolutional neural network that integrates multi-scale feature extraction capabilities, an attention mechanism and long short-term memory units, which effectively facilitates real-time denoising of diverse shaping pulses. We train the model using simulated datasets and evaluate it on both the simulated and experimental temporal waveforms. During the evaluation of simulated waveforms, we achieve high-precision denoising, resulting in great performance for temporal waveforms with frequency modulation-to-amplitude modulation conversion (FM-to-AM) exceeding 50%, exceedingly high contrast of over 300:1 and multi-step structures. The errors are less than 1% for both root mean square error and contrast, and there is a remarkable improvement in the signal-to-noise ratio by over 50%. During the evaluation of experimental waveforms, the model can obtain different denoised waveforms with contrast greater than 200:1. The stability of the model is verified using temporal waveforms with identical pulse widths and contrast, ensuring that while achieving smooth temporal profiles, the intricate details of the signals are preserved. The results demonstrate that the denoising model, trained utilizing the simulation dataset, is capable of efficiently processing complex temporal waveforms in real-time for experiments and mitigating the influence of electronic noise and FM-to-AM on the time–power curve.