DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models

Zhao, Brian_Nlong; Xiao, Yuhang; Xu, Jiashu; Jiang, Xinyang; Yang, Yifan; Li, Dongsheng; Itti, Laurent; Vineet, Vibhav; Ge, Yunhao

Citation Details

The popularization of Text-to-Image (T2I) diffusion mod- els enables the generation of high-quality images from text descriptions. However, generating diverse customized im- ages with reference visual attributes remains challenging. This work focuses on personalizing T2I diffusion models at a more abstract concept or category level, adapting com- monalities from a set of reference images while creating new instances with sufficient variations. We introduce a solution that allows a pretrained T2I diffusion model to learn a set of soft prompts, enabling the generation of novel images by sampling prompts from the learned distri- bution. These prompts offer text-guided editing capabilities and additional flexibility in controlling variation and mix- ing between multiple distributions. We also show the adapt- ability of the learned prompt distribution to other tasks, such as text-to-3D. Finally we demonstrate effectiveness of our approach through quantitative analysis including auto- matic evaluation and human assessment. more »

Award ID(s):: 2318101

PAR ID:: 10536060

Author(s) / Creator(s):: Zhao, Brian_Nlong; Xiao, Yuhang; Xu, Jiashu; Jiang, Xinyang; Yang, Yifan; Li, Dongsheng; Itti, Laurent; Vineet, Vibhav; Ge, Yunhao

Publisher / Repository:: Arxiv

Date Published:: 2023-12-21

Format(s):: Medium: X

Institution:: USC

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Posted Content:
The DOI is not currently available.

More Like this