AI RESEARCH
Mitigating Adaptive Attacks against Reasoning Models with Activation Consistency Training
arXiv CS.LG
•
ArXi:2605.28467v1 Announce Type: new As LLMs gain stronger reasoning capabilities, their extended chain-of-thought