AI SAFETY & ETHICS

theory uplift differentially benefits safety & is underleveraged

LessWrong AI

We will likely have near-superhuman mathematics AI by Q1 2027. Qualitatively, AI mathematics capabilities are developing significantly faster than automated AI R&D capabilities. Thus, we will likely have a period of time where the rate of our ability to rigorously & usefully verify and understand model behavior and model outputs outpaces the rate of capability development itself.