AI RESEARCH

Social Caption: Evaluating Social Understanding in Multimodal Models

arXiv CS.LG

ArXi:2601.14569v2 Announce Type: replace-cross Social understanding abilities are crucial for multimodal large language models (MLLMs) to interpret human social interactions. We