2023
DOI: 10.48550/arxiv.2303.10834
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Object-Centric Slot Diffusion

Abstract: Despite remarkable recent advances, making object-centric learning work for complex natural scenes remains the main challenge. The recent success of adopting the transformer-based image generative model in object-centric learning suggests that having a highly expressive image generator is crucial for dealing with complex scenes. In this paper, inspired by this observation, we aim to answer the following question: can we benefit from the other pillar of modern deep generative models, i.e., the diffusion models,… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 40 publications
0
1
0
Order By: Relevance
“…Text-to-image diffusion models Diffusion models [10,19,21,41,[58][59][60][61][62] have proven to be highly effective in learning data distributions and have shown impressive results in image synthesis, leading to various applications [8,26,27,29,31,32,36,46,56,74]. Recent advancements have also explored transformer-based architectures [6,45,67].…”
Section: Related Workmentioning
confidence: 99%
“…Text-to-image diffusion models Diffusion models [10,19,21,41,[58][59][60][61][62] have proven to be highly effective in learning data distributions and have shown impressive results in image synthesis, leading to various applications [8,26,27,29,31,32,36,46,56,74]. Recent advancements have also explored transformer-based architectures [6,45,67].…”
Section: Related Workmentioning
confidence: 99%