AI SAFETY & ETHICS

Retrying vs Resampling in AI Control

LessWrong AI

We’ve just released a new paper: Retrying vs Resampling in AI Control. We revisit the resampling protocols