AI RESEARCH
Do Audio LLMs Listen or Read? Analyzing and Mitigating Paralinguistic Failures with VoxParadox
arXiv CS.LG
•
ArXi:2605.27772v1 Announce Type: cross Audio large language models (Audio LLMs) nstrate strong performance on speech understanding tasks, yet their ability to understand paralinguistic information remains limited. To systematically quantify this issue, we