Enhancing Diffusion Models with RL & Adversarial Rewards
21.7% FID reduction via RL fine-tuning with adversarial reward signals. Plug-and-play for existing models.
Formulated reverse diffusion as MDP with adversarial discriminators, achieving 21.7% FID reduction vs. baseline. Plug-and-play for existing models.