AI RESEARCH
Linear-DPO: Linear Direct Preference Optimization for Diffusion and Flow-Matching Generative Models
arXiv CS.LG
•
ArXi:2605.21123v1 Announce Type: cross Direct Preference Optimization (DPO) is successful for alignment in LLMs but still faces challenges in text-to-image generation.