Which Human Faces Can an Ai Generate? Lack of Diversity in This Person
        Does Not Exist

Sequeira, Lucas Nunes; Moreschi, Bruno; Santos, Vinicius Ariel Arruda dos

doi:10.5210/spir.v2021i0.12240

Cited by 2 publications

(2 citation statements)

References 0 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Text-to-image generation models, in particular, have the potential to extend image-editing capabilities and lead to the development of new tools for creative practitioners. On the other hand, generative methods can be leveraged for malicious purposes, including harassment and misinformation spread [20], and raise many concerns regarding social and cultural exclusion and bias [67,62,68]. These considerations inform our decision to not to release code or a public demo.…”

Section: Conclusion Limitations and Societal Impactmentioning

confidence: 99%

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Saharia¹,

Chan²,

Saxena³

et al. 2022

Preprint

232

312

View full text Add to dashboard Cite

We present Imagen, a text-to-image diffusion model with an unprecedented degree of photorealism and a deep level of language understanding. Imagen builds on the power of large transformer language models in understanding text and hinges on the strength of diffusion models in high-fidelity image generation. Our key discovery is that generic large language models (e.g. T5), pretrained on text-only corpora, are surprisingly effective at encoding text for image synthesis: increasing the size of the language model in Imagen boosts both sample fidelity and imagetext alignment much more than increasing the size of the image diffusion model. Imagen achieves a new state-of-the-art FID score of 7.27 on the COCO dataset, without ever training on COCO, and human raters find Imagen samples to be on par with the COCO data itself in image-text alignment. To assess text-to-image models in greater depth, we introduce DrawBench, a comprehensive and challenging benchmark for text-to-image models. With DrawBench, we compare Imagen with recent methods including VQ-GAN+CLIP, Latent Diffusion Models, GLIDE and DALL-E 2, and find that human raters prefer Imagen over other models in side-byside comparisons, both in terms of sample quality and image-text alignment. See imagen.research.google for an overview of the results. * Equal contribution. † Core contribution.

show abstract

Section: Conclusion Limitations and Societal Impactmentioning

confidence: 99%

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Saharia¹,

Chan²,

Saxena³

et al. 2022

Preprint

232

312

View full text Add to dashboard Cite

show abstract

“…Instead of looking at these results as failures, we rather suggest to embrace those 'anticipated' or 'biased' outputs. As these models are trained on real-world datasets and thus incorporate human biases [67,72], they can help to surface, identify, and confront existing assumptions and preconceptions. As such, using generative AI models can be an effective approach to make robot and social stereotypes visible [2] in order to then challenge them through designerly action.…”

Section: Discussionmentioning

confidence: 99%

Creative AI for HRI Design Explorations

Hoggenmueller

Lupetti

Maden

et al. 2023

Companion of the 2023 ACM/IEEE International Conference on Human-Robot Interaction

View full text Add to dashboard Cite

Design fixation, a phenomenon describing designers' adherence to pre-existing ideas or concepts that constrain design outcomes, is particularly prevalent in human-robot interaction (HRI), for example, due to collectively held and stabilised imaginations of what a robot should look like or behave. In this paper, we explore the contribution of creative AI tools to overcome design fixation and enhance creative processes in HRI design. In a four weeks long design exploration, we used generative text-to-image models to ideate and visualise robotic artefacts and robot sociotechnical imaginaries. We exchanged results along with reflections through a digital postcard format. We demonstrate the usefulness of our approach to imagining novel robot concepts, surfacing existing assumptions

show abstract

Which Human Faces Can an Ai Generate? Lack of Diversity in This Person Does Not Exist

Cited by 2 publications

References 0 publications

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

Creative AI for HRI Design Explorations

Contact Info

Product

Resources

About