AI RESEARCH

Cultural Value Alignment Via Latent Activation Steering in Large Language Models

arXiv CS.CL

ArXi:2605.26365v1 Announce Type: new Large Language Models (LLMs) often exhibit homogenized cultural perspectives. While the World Values Survey (WVS) provides a gold standard for mapping human values, traditional direct prompting of LLMs on WVS often fails to access the model's latent cultural depth, leading to safety-aligned refusals or neutral responses. Here, we propose a generalizable framework for cultural evaluation and intervention that transitions from abstract queries to scenario-based behavioral probing.