UniMedVL: Unifying Medical Multimodal Understanding and Generation through Observation-Knowledge-Analysis

ArXi:2510.15710v3 Announce Type: replace Medical workflows routinely combine reading images with producing visual and textual outputs, making both image understanding and generation central to medical AI. Most existing systems, however, address these abilities in isolated models, losing the shared knowledge that a unified architecture could exploit. To bridge this gap, we present UniMedVL, the first unified medical model that seamlessly integrates multimodal understanding and generation capabilities within a single model without switching weights. We achieve this via a tailored progressive.