Safe Reinforcement Learning (Safe RL) is a subfield of reinforcement learning dedicated to ensuring that an autonomous agent's learning process and resulting policy satisfy defined safety constraints. Unlike standard RL, which focuses solely on maximizing cumulative reward, Safe RL explicitly incorporates mechanisms to avoid catastrophic states or actions, such as physical damage to a robot or unsafe interactions in a shared human environment. This is formalized through constrained Markov Decision Processes (CMDPs), which augment the standard MDP framework with cost functions that must remain below specified thresholds.




