Multi-Concept Customization of Text-to-Image Diffusion

1 CMU    2Tsinghua University    3Adobe Research

CVPR 2023

Code Paper Project Gallery Slides Data

We release a dataset consisting of 101 concepts with 3-15 images in each concept for evaluating model customization methods. Target real images of each concept in the Dataset are shown below.

We introduce both single-concept and multi-concept settings with evaluation text prompts for each case. Below we show random samples with Ours, DreamBooth, and Textual Inversion method for each concept. Scroll horizontally to see all samples with different test prompts.

Dataset and prompt creation: we collected images from Unsplash or ourselves for concepts across a variety of categories, namely, toys, plushies, wearables, scenes, transport vehicles, furniture, home decor items, luggage, human faces, musical instruments, rare flowers, food items, pet animals. For creating evaluation prompts, we first used ChatGPT to generate 40 image captions for each concept with the instructions to either (1) change the background while keeping the main subject, (2) insert a new object/living thing in the scene along with the main subject, (3) style variation of the main subject, and (4) change the property or material of the main subject. The generated text prompts are manually filtered or modified to get the final 20 prompts for each concept. A similar strategy is applied for multiple concepts. Some of the prompts are also inspired by other concurrent works e.g. Perfusion, DreamBooth, SuTI, BLIP-Diffusion etc.

License: Images taken from UnSplash are under Unsplash License. Images collected by us are released under CC BY-SA 4.0 license. Flower category images are downloaded from Wikimedia/Flickr/Pixabay and the link to orginial images can also be found here



Please refer to our code for details regarding dataset download, text prompts, and evaluation code for single-concept and multi-concept customization.

Action Figure

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Action Figure

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Figurine

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Houseplant

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Houseplant

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Houseplant

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Lamp

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Vase

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Wooden Pot

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Dish

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Dish

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Flower

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Flower

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Chair

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Chair

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Chair

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Sofa

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Sofa

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Table

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Guitar Amplifier

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Guitar

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Guitar

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Violin

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Earrings

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Ring

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Backpack

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Purse

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Purse

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Purse

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Purse

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Jun-Yan Zhu

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Richard Zhang

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Eli Shechtman

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Cat

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Cat

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Cat

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Cat

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Cat

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Cat

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Cat

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Dog

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Dog

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Dog

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Dog

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Pokemon Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Bunny Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Cow Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Dice Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Lobster Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Panda Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Penguin Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Teddybear

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Tortoise Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Unicorn Plushie

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Barn

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Canal Scene

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Castle

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Garden

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Lighthouse

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Sculpture

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Waterfall

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Book

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Book

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Bottle

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Corkscrew

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Cup

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Cup

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Cup

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Headphone

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Headphone

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Helmet

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Keychain

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Bear

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Toy Gnome

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Pokemon Toy

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Kids Table Chair

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Toy

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Bike

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Car

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Car

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Car

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Car

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Car

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Car

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Car

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Car

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Car

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Car

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Car

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Motorbike

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Tank

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Glasses

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Jacket

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Jacket

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Shoes

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Shoes

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Sunglasses

Custom Diffusion

Dreambooth

Textual Inversion

Image 1


Sunglasses

Custom Diffusion

Dreambooth

Textual Inversion

Image 1



Acknowledgements

We are grateful to Sheng-Yu Wang, Songwei Ge, Daohan Lu, Ruihan Gao, Roni Shechtman, Avani Sethi, Yijia Wang, Shagun Uppal, and Zhizhuo Zhou for helping with the dataset collection, and Nick Kolkin for the feedback.