RETRACTED ARTICLE: A comprehensive survey on generative adversarial networks used for synthesizing multimedia content

Kumar, Lalit; Singh, Dushyant Kumar

doi:10.1007/s11042-023-15138-x

Cited by 13 publications

(5 citation statements)

References 42 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…These networks undergo adversarial training, where the generator aims to create images capable of deceiving the discriminator, while the discriminator strives to accurately discern the generated images. GANs have proven effective in generating top‐notch facial images, with notable advancements seen in recent GAN variations like StyleGAN and BigGAN, showcasing impressive outcomes in this domain 3,5 …”

Section: Related Workmentioning

confidence: 99%

“…In this application, a GAN consists of a generator network trained to create realistic facial images and a discriminator network trained to distinguish between authentic faces and those generated by the generator. Through an adversarial training process, the generator refines its ability to produce increasingly convincing facial images, while the discriminator simultaneously improves its capability to discern real from generated faces 2,3 . This dynamic interplay results in the generation of high‐quality and natural‐looking face images, making GANs a powerful tool for tasks such as face synthesis, augmentation, and facial expression manipulation.…”

Section: Introductionmentioning

confidence: 99%

“…Furthermore, GANs cannot guarantee the confidentiality and security of the generated images, as the generator may generate images of real people. Therefore, it is important to consider the ethical implications of using GANs to generate images of people and obtain consent from images used in the training dataset 3 . This manuscript is totally based on realistic face image generation for automated content creation of Video Logs.…”

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Diversified realistic face image generationGANfor human subjects in multimedia content creation

Kumar,

Singh

2024

Computer Animation & Virtual

Self Cite

View full text Add to dashboard Cite

Face image generation plays an important role in generating innovative and unique multimedia content using the GAN model. With these qualities of the GAN model, they have numerous challenges in the human face image generation. The problems encountered in the generation of facial images are like blurriness in images, incomplete details in the generated facial images, high computational power requirements, and so forth. In this manuscript, we proposed a GAN model that utilizes the composite strength of VGG‐16 and ResNet‐50's models to overcome those difficulties. It uses VGG‐16 to build a discriminator model to discriminate between real and fake images. The generator model utilizes a combination of components from the ResNet‐50 and VGG‐16 models to enhance the image generation process at each iteration, resulting in the creation of realistic face images. The proposed DRFI GAN (Diversified and Realistic Face Image Generation GAN) model's generator achieves an impressive low FID score of 20.50, which is less than existing state‐of‐the‐art approaches. Furthermore, our findings indicate that the images generated by the DRFI GAN model exhibit 10%–15% greater efficiency and realism with reduced training time compared to existing state‐of‐the‐art methods with lower FID scores.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Diversified realistic face image generationGANfor human subjects in multimedia content creation

Kumar,

Singh

2024

Computer Animation & Virtual

Self Cite

View full text Add to dashboard Cite

show abstract

“…Teknik yang digunakan antara lain generative adversarial networks, variational autoencoders, transformer-based model, dan deep generative models lainnya untuk mensintesis dan memanipulasi gambar fashion dari teks [78].…”

Section: Oxford 102 Flowerunclassified

Sintesis Teks Ke Gambar: Tinjauan Atas Dataset

Arifin

2024

EEICT

View full text Add to dashboard Cite

Penelitian ini bertujuan untuk melakukan analisis mendalam terhadap berbagai dataset yang digunakan dalam riset sintesis teks ke gambar. Fokus utama penelitian ini adalah pada pemahaman karakteristik masing-masing dataset, pengaruh pemilihan dataset terhadap hasil penelitian, serta keunggulan dan kelemahan setiap dataset. Beberapa dataset yang diteliti meliputi MS COCO, CUB-200-2011, dan Oxford 102 Flower, bersama dengan dataset-domain khusus lainnya yang relevan. Metode penelitian mencakup analisis deskriptif terhadap jumlah gambar, karakteristik visual, dan deskripsi teks yang melibatkan setiap dataset. Data yang diperoleh dianalisis secara kualitatif untuk mendapatkan wawasan mendalam tentang setiap dataset. Hasil analisis diharapkan dapat memberikan panduan bagi peneliti dalam memilih dataset yang sesuai dengan tujuan penelitian mereka dalam sintesis teks ke gambar. Penelitian ini diakhiri dengan rekomendasi dan kesimpulan yang merangkum temuan utama dan relevansinya dalam konteks riset ini.

show abstract

“…Both use a machine learning approach, because it can shorten the time in analyzing and observing the development of a disease [9]. Moreover, machine learning can automatically make it easier to process large amounts of data, including attributes of vocal sounds [10]. The sound extraction in this study was used as a data attribute for the learning process of a detection system with a machine learning approach.…”

Section: Introductionmentioning

confidence: 99%

Hilbert-Schmidt Independence Criterion Lasso Feature Selection in Parkinson’s Disease Detection System

Wiharto,

Sucipto,

Salamah

2023

IJFIS

View full text Add to dashboard Cite

Parkinson's disease is a neurological disorder which interferes human activities. Early detection is needed to facilitate treatment before the symptoms get worse. Earlier detection used vocal voice as a comparison with normal subject. However, detection using vocal voice still has weaknesses in detection system. Vocal voice contains a lot of information that isn't necessarily relevant for a detection system. Previous studies proposed a feature selection method on detection system. However, the proposed method can't handle variation in the amount of data. These variations include an imbalance sample to features and classes. In answering these problems, the Hilbert-Schmidt Independence Criterion Lasso (HSIC Lasso) feature selection method is used which has feature transformation capabilities that can produce more relevant features. In addition, detection system uses Synthetic Minority Oversampling Technique (SMOTE) method to balance data and several classification methods such as knearest neighbors, support vector machine, and multilayer perceptron to obtain best predictive model. HSIC Lasso produces 18 of 45 features with an accuracy of 88.34% on a small sample and 50 of 754 features with an accuracy of 96.16% on a large sample. From this result, when compared with previous studies, HSIC Lasso is more suitable on balanced data with more samples and features.

show abstract

RETRACTED ARTICLE: A comprehensive survey on generative adversarial networks used for synthesizing multimedia content

Cited by 13 publications

References 42 publications

Diversified realistic face image generationGANfor human subjects in multimedia content creation

Diversified realistic face image generationGANfor human subjects in multimedia content creation

Sintesis Teks Ke Gambar: Tinjauan Atas Dataset

Hilbert-Schmidt Independence Criterion Lasso Feature Selection in Parkinson’s Disease Detection System

Contact Info

Product

Resources

About