AI RESEARCH

The Alignment Floor: When Persona Customization Is Safe

arXiv CS.AI

ArXi:2605.27382v1 Announce Type: cross A key promise of pluralistic AI is behavioral adaptation: persona prompts like "be creative" or "be thorough" let systems respect diverse user values and communication styles. But how much customization can a model absorb before its alignment breaks? We present the first controlled study of the alignment-customization tradeoff, testing seven persona conditions across five tasks on two models with different alignment strengths (1,800 runs