AI SAFETY & ETHICS
Retrying vs Resampling in AI Control
LessWrong AI
•
We’ve just released a new paper: Retrying vs Resampling in AI Control. We revisit the resampling protocols