AI RESEARCH

Mitigating Adaptive Attacks against Reasoning Models with Activation Consistency Training

arXiv CS.LG

ArXi:2605.28467v1 Announce Type: new As LLMs gain stronger reasoning capabilities, their extended chain-of-thought