Diffusion in Style

Everaert, Martin Nicolas; Bocchio, Marco; Arpa, Sami; Süsstrunk, Sabine; Achanta, Radhakrishna

doi:10.1109/ICCV51070.2023.00214

conference paper

Diffusion in Style

Everaert, Martin Nicolas

•

Bocchio, Marco

•

Arpa, Sami

October 2023

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV23)

IEEE/CVF International Conference on Computer Vision (ICCV23)

We present Diffusion in Style, a simple method to adapt Stable Diffusion to any desired style, using only a small set of target images. It is based on the key observation that the style of the images generated by Stable Diffusion is tied to the initial latent tensor. Not adapting this initial latent tensor to the style makes fine-tuning slow, expensive, and impractical, especially when only a few target style images are available. In contrast, fine-tuning is much easier if this initial latent tensor is also adapted. Our Diffusion in Style is orders of magnitude more sample-efficient and faster. It also generates more pleasing images than existing approaches, as shown qualitatively and with quantitative comparisons.

Name

Everaert_Diffusion_in_Style_ICCV_2023_paper.pdf

Type

Postprint

Version

http://purl.org/coar/version/c_ab4af688f83e57aa

Access type

openaccess

License Condition

n/a

Size

8.61 MB

Format

Adobe PDF

Checksum (MD5)

ea841b9be02d4e77eb923fbbd1d06cf3