DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion ModelsYing Fan,Olivia Watkins,Yuqing Du,Hao Liu,Moonkyung Ryu,Craig Boutilier,Pieter Abbeel,Mohammad Ghavamzadeh,Kangwook Lee,Kimin LeeNeurIPS 2023(2023)引用 186|浏览187关键词Diffusion models,RLHFAI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要