CARE-RL: Capability-Aware Reinforcement Learning for Mitigating Cross-Domain Conflicts

ArXi:2606.00609v1 Announce Type: cross Reinforcement learning (RL) with verifiable rewards has achieved strong progress in reasoning-oriented LLMs, but extending it to multi-domain RL remains challenging due to reward unreliability in non-verifiable tasks and capability interference across domains. We propose CARE-RL to combine protocol-aware reward generation with capability-aware optimization for mitigating cross-domain conflicts.