Abstract-Calculation of the EM scattered fields from a three-layer canopy faces intensive computational burden, when the area becomes large and obviously lames the application of the traditional serial algorithm. With the development of graphics hardware, the Graphics Processing Unit (GPU) can be used to calculate the electromagnetic (EM) scattering problems parallelly. In this paper, the Compute Unified Device Architecture (CUDA) is combined with the four-path method and the reciprocity theorem to improve the speed of calculating the EM scattering properties from a three layer-canopy which are simulated by employing the Monte-Carlo method. We get a highest speedup of 294 times in comparison with the original serial algorithm on a Core (TM) i5 CPU with a GTS460 GPU as a coprocessor.