AI RESEARCH
Consistency Training while Mitigating Obfuscation via Rate Matching
arXiv CS.AI
•
ArXi:2606.02211v1 Announce Type: cross Large language models are often influenced by extraneous input features, such as cues revealing a user's preferred answer. Consistency