Causal representation learning is a subfield of machine learning focused on discovering latent, interpretable causal variables and their structural relationships directly from high-dimensional, unstructured observational data, such as images, video, or text. Unlike standard representation learning, which seeks statistically useful features, the goal is to learn a disentangled representation where each dimension corresponds to an underlying causal factor of the data-generating process. This approach is foundational for building AI agents that can reason about interventions, generalize robustly across environments, and answer counterfactual questions.
