AI RESEARCH

The Alignment Curse: Modality Alignment Supercharges Audio Attacks via Text Transfer

arXiv CS.AI

ArXi:2602.02557v2 Announce Type: replace-cross Recent advances in end-to-end trained omni-models have substantially improved audio capabilities by strengthening text-audio modality alignment. However, whether such alignment inadvertently facilitates the transfer of safety vulnerabilities across modalities remains underexplored. This question is critical as text-based jailbreak attacks are considerably mature than audio-based ones; if they transfer systematically, current audio safety evaluations may underestimate risks originating from the text modality. In this paper, we