Causal Reinforcement Learning (Causal RL) is a framework where an agent learns not just correlations but the causal mechanisms governing its environment. This is achieved by learning or leveraging a Structural Causal Model (SCM) or causal graph. The agent uses this model to reason about interventions (the do-operator) and counterfactuals, allowing it to predict the effects of its actions more accurately and plan over longer horizons. This causal understanding directly targets core RL challenges like sample efficiency, generalization to new situations, and robustness to distribution shifts, as the agent learns invariant relationships rather than spurious correlations.
