AI RESEARCH
Social Caption: Evaluating Social Understanding in Multimodal Models
arXiv CS.LG
•
ArXi:2601.14569v2 Announce Type: replace-cross Social understanding abilities are crucial for multimodal large language models (MLLMs) to interpret human social interactions. We