AI RESEARCH
Leveraging Vision-Language Models to Detect Attention in Educational Videos
arXiv CS.CV
•
ArXi:2605.20211v1 Announce Type: new Educational videos are a cornerstone of remote and blended learning. However, learners' fluctuating attention remains a significant barrier to effective information retention. Prior research has attempted to mitigate this by detecting and reacting to attention loss at runtime using eye tracking. Such detection has been based so far on classical machine learning classifiers trained on engineered features, such as summary statistics over learners' fixations and saccades.