“…This GAN model takes both image and a text that describes an object to generate a new image containing this object. Also, MRP‐GAN (Qi, Fan, et al, 2021), SAM‐GAN (Peng et al, 2021), DM‐GAN (M. Zhu, Pan, et al, 2019), DAE‐GAN (Ruan et al, 2021), KT‐GAN (Tan et al, 2021), Bridge‐GAN (M. Yuan & Peng, 2020), CF‐GAN (Y. Zhang, Han, et al, 2022), DGattGAN (H. Zhang, Zhu, et al, 2021), PCCM‐GAN (Qi, Sun, et al, 2021), aRTIC GAN (Alati et al, 2022), and CDRGAN (M. Wang et al, 2021) were proposed to generate natural images based on a descriptive texts that describe these images. Likewise, Y. Zhou (2021), M. Z. Khan et al (2021), and Y. Zhou and Shimada (2021) proposed GAN models to synthesize face images based on the text describing these faces.…”