Jonathan Ho scite author profile

We present Imagen, a text-to-image diffusion model with an unprecedented degree of photorealism and a deep level of language understanding. Imagen builds on the power of large transformer language models in understanding text and hinges on the strength of diffusion models in high-fidelity image generation. Our key discovery is that generic large language models (e.g. T5), pretrained on text-only corpora, are surprisingly effective at encoding text for image synthesis: increasing the size of the language model in Imagen boosts both sample fidelity and imagetext alignment much more than increasing the size of the image diffusion model. Imagen achieves a new state-of-the-art FID score of 7.27 on the COCO dataset, without ever training on COCO, and human raters find Imagen samples to be on par with the COCO data itself in image-text alignment. To assess text-to-image models in greater depth, we introduce DrawBench, a comprehensive and challenging benchmark for text-to-image models. With DrawBench, we compare Imagen with recent methods including VQ-GAN+CLIP, Latent Diffusion Models, GLIDE and DALL-E 2, and find that human raters prefer Imagen over other models in side-byside comparisons, both in terms of sample quality and image-text alignment. See imagen.research.google for an overview of the results. * Equal contribution. † Core contribution.

show abstract

The Impact of Acquisitions on Operating Performance: Some Australian Evidence

Sharma

2002

Business Fin & Account

173

214

View full text Add to dashboard Cite

This study investigates the impact of acquisitions on the operating performance of Australian firms. For a sample of 36 Australian acquisitions occurring between 1986 to 1991 inclusive, and using matched firms to control for industry and economy-wide factors, the results based on four accrual and four cash flow performance measures show that corporate acquisitions do not lead to significant improvements in post-acquisition operating performance. The consistency of the results with the agency, the hubris and the financial motivation hypotheses suggests that corporate acquisitions in Australia may be undertaken for other than synergistic reasons. The results assist in explaining inconsistent findings reported in the literature. Copyright Blackwell Publishers Ltd 2002.

show abstract

The effects of product-related, personal-related factors and attractiveness of alternatives on consumer adoption of NFC-based mobile payments

Pham

2015

Technology in Society

240

197

View full text Add to dashboard Cite

Classifier-Free Diffusion Guidance

Ho¹,

Salimans²

2022

Preprint

157

153

View full text Add to dashboard Cite

Classifier guidance is a recently introduced method to trade off mode coverage and sample fidelity in conditional diffusion models post training, in the same spirit as low temperature sampling or truncation in other types of generative models. Classifier guidance combines the score estimate of a diffusion model with the gradient of an image classifier and thereby requires training an image classifier separate from the diffusion model. It also raises the question of whether guidance can be performed without a classifier. We show that guidance can be indeed performed by a pure generative model without such a classifier: in what we call classifier-free guidance, we jointly train a conditional and an unconditional diffusion model, and we combine the resulting conditional and unconditional score estimates to attain a trade-off between sample quality and diversity similar to that obtained using classifier guidance.

show abstract

Image Super-Resolution via Iterative Refinement

Saharia¹,

Ho²,

Chan³

et al. 2021

Preprint

130

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Jonathan Ho

Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

The Impact of Acquisitions on Operating Performance: Some Australian Evidence

The effects of product-related, personal-related factors and attractiveness of alternatives on consumer adoption of NFC-based mobile payments

Classifier-Free Diffusion Guidance

Image Super-Resolution via Iterative Refinement

Contact Info

Product

Resources

About