AI RESEARCH
Detector-Evasive LLM Paraphrasing via Constrained Policy Optimization
arXiv CS.AI
•
ArXi:2606.00392v1 Announce Type: cross AI-text detectors are vulnerable to paraphrasing and detector-guided paraphrasing attacks, but existing detector-evasion methods often lack precise control over semantic preservation. In particular, optimizing directly for detector evasion can degrade fine-grained semantics, whereas scalarized reward designs provide only indirect, weight-sensitive control over the evasion-semantics trade-off.