AI RESEARCH
Safety Generalization Under Distribution Shift in Safe Reinforcement Learning: A Diabetes Testbed
arXiv CS.AI
•
ArXi:2601.21094v2 Announce Type: replace-cross Safe Reinforcement Learning (RL) algorithms are typically evaluated under fixed