Theory-theory is a cognitive science hypothesis proposing that individuals understand others' mental states by employing an innate or learned folk-psychological theory. This internal 'theory' consists of causal laws linking observable behavior to unobservable mental states like beliefs, desires, and intentions. In AI and agentic systems, implementing theory-theory means endowing an agent with a structured, inferential model to predict and explain the behavior of other agents by attributing such internal states to them, a core capability for multi-agent cooperation and social cognition.
