AI SAFETY & ETHICS
theory uplift differentially benefits safety & is underleveraged
LessWrong AI
•
We will likely have near-superhuman mathematics AI by Q1 2027. Qualitatively, AI mathematics capabilities are developing significantly faster than automated AI R&D capabilities. Thus, we will likely have a period of time where the rate of our ability to rigorously & usefully verify and understand model behavior and model outputs outpaces the rate of capability development itself.