AI RESEARCH
Visual Persuasion: What Influences Decisions of Vision-Language Models?
arXiv CS.AI
•
ArXi:2602.15278v2 Announce Type: replace-cross The web is littered with images, once created for human consumption and now increasingly interpreted by agents using vision-language models (VLMs). These agents make visual decisions at scale, deciding what to click, recommend, or buy. Yet, we know little about the structure of their visual preferences. We