Diffusion in Style
We present Diffusion in Style, a simple method to adapt Stable Diffusion to any desired style, using only a small set of target images. It is based on the key observation that the style of the images generated by Stable Diffusion is tied to the initial latent tensor. Not adapting this initial latent tensor to the style makes fine-tuning slow, expensive, and impractical, especially when only a few target style images are available. In contrast, fine-tuning is much easier if this initial latent tensor is also adapted. Our Diffusion in Style is orders of magnitude more sample-efficient and faster. It also generates more pleasing images than existing approaches, as shown qualitatively and with quantitative comparisons.
Everaert_Diffusion_in_Style_ICCV_2023_paper.pdf
postprint
openaccess
n/a
8.61 MB
Adobe PDF
ea841b9be02d4e77eb923fbbd1d06cf3
Everaert_Diffusion_in_Style_ICCV_2023_supplemental.pdf
postprint
openaccess
n/a
29.52 MB
Adobe PDF
f6fa0b5d305f886f1359c52d1c135cb9