AI RESEARCH

Safety Generalization Under Distribution Shift in Safe Reinforcement Learning: A Diabetes Testbed

arXiv CS.AI

ArXi:2601.21094v2 Announce Type: replace-cross Safe Reinforcement Learning (RL) algorithms are typically evaluated under fixed