Kavli Affiliate: Yi Zhou | First 5 Authors: Shaocong Ma, Shaocong Ma, , , | Summary: The goal of robust constrained reinforcement learning (RL) is to optimize an agent’s performance under the worst-case model uncertainty while satisfying safety or resource constraints. In this paper, we demonstrate that strong duality does not generally hold in robust […]
Continue.. Rectified Robust Policy Optimization for Model-Uncertain Constrained Reinforcement Learning without Strong Duality