AI RESEARCH
Cluster-Level Attention-Guided Parallel Decoding for Masked Diffusion Language Models
arXiv CS.LG
•
ArXi:2605.29607v1 Announce Type: new Masked diffusion language models (MDLMs) enable parallel decoding by predicting all masked positions at each denoising step, yet existing