AI RESEARCH
Head-Pose-Aware Visual Speech Recognition with FiLM Modulation
arXiv CS.CV
•
ArXi:2606.00751v1 Announce Type: new Visual Speech Recognition (VSR) aims to recognize speech from visual cues such as lip movements, but its performance is fundamentally limited by viseme ambiguity and pose-induced variations that