AI RESEARCH

Head-Pose-Aware Visual Speech Recognition with FiLM Modulation

arXiv CS.CV

ArXi:2606.00751v1 Announce Type: new Visual Speech Recognition (VSR) aims to recognize speech from visual cues such as lip movements, but its performance is fundamentally limited by viseme ambiguity and pose-induced variations that