TUBE: Tangent Upper Bound on Evidence for Discrete Diffusion Language Models

ArXi:2605.24292v1 Announce Type: new Log-likelihood is a standard metric for evaluating generative models. Unfortunately, in contrast to autoregressive models (ARMs), discrete diffusion models generally do not admit exact computation of this quantity. Existing evaluations, therefore, rely on the evidence lower bound (ELBO), leaving unclear how much higher the true value may be. We address this by