AI RESEARCH

Linear-DPO: Linear Direct Preference Optimization for Diffusion and Flow-Matching Generative Models

arXiv CS.LG

ArXi:2605.21123v1 Announce Type: cross Direct Preference Optimization (DPO) is successful for alignment in LLMs but still faces challenges in text-to-image generation.