RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control

Rout, Litu; Chen, Yujia; Ruiz, Nataniel; Kumar, Abhishek; Caramanis, Constantine; Shakkottai, Sanjay; Chu, Wen-Sheng

Citation Details

The authors propose Reference-Based Modulation (RB-Modulation), a plug-and-play, training-free solution for personalization of diffusion models. Existing training-free methods face challenges in (a) extracting style from reference images without additional style or content text descriptions, (b) avoiding unwanted content leakage from style references, and (c) composing style and content effectively. RB-Modulation addresses these issues using a novel stochastic optimal controller, where a style descriptor encodes the desired attributes through a terminal cost. The induced drift ensures high fidelity to the reference style while adhering to the text prompt. Additionally, the authors introduce a cross-attention-based feature aggregation scheme that decouples content and style from the reference image. With both theoretical justification and empirical validation, RB-Modulation demonstrates precise control of content and style in a training-free manner, while enabling seamless composition—eliminating reliance on external adapters or ControlNets. more »

Award ID(s):: 2505865

PAR ID:: 10631949

Author(s) / Creator(s):: Rout, Litu; Chen, Yujia; Ruiz, Nataniel; Kumar, Abhishek; Caramanis, Constantine; Shakkottai, Sanjay; Chu, Wen-Sheng

Publisher / Repository:: https://doi.org/10.48550/arXiv.2405.17401

Date Published:: 2024-05-27

Format(s):: Medium: X

Sponsoring Org:: National Science Foundation

Free Publicly Accessible Full Text
Accepted Manuscript1.0
Conference Paper:
The DOI is not currently available.

More Like this