Causal fairness is a framework for assessing and ensuring algorithmic fairness using causal models to define and measure discrimination along specific causal pathways, distinguishing between direct, indirect, and spurious effects of sensitive attributes like race or gender. Unlike statistical fairness metrics that rely on correlations, it uses tools like Structural Causal Models (SCMs) and causal graphs to answer counterfactual questions (e.g., 'Would the decision have been different if the individual's protected attribute were changed?'). This allows for precise, legally-grounded definitions of fairness, such as counterfactual fairness, which holds if an outcome is the same in the actual world and a counterfactual world where the protected attribute differs.
