A false belief task is a structured psychological test designed to evaluate whether an individual or artificial intelligence system possesses a Theory of Mind (ToM)—the ability to understand that others can hold beliefs about the world that are different from both reality and the evaluator's own knowledge. The canonical example is the Sally-Anne test, where an observer must predict an agent's action based on the agent's outdated (false) belief about an object's location. Success requires mental state attribution, specifically recognizing that another's actions are guided by their internal beliefs, not objective truth.
Glossary
False Belief Task

What is a False Belief Task?
A standard test used in developmental psychology and artificial intelligence to assess an entity's capacity for Theory of Mind.
In AI and multi-agent systems research, implementing false belief task competency is a benchmark for social cognition. It requires an agent to maintain a recursive model of another agent's knowledge state, which is foundational for cooperative planning, strategic reasoning, and natural communication. This capability is critical for developing agents that can engage in deception detection, manage shared mental models, and perform inverse planning to infer the goals of others, enabling robust interaction in partially observable environments.
Key Variants and Examples in AI
The False Belief Task is a foundational benchmark for assessing an AI system's capacity for Theory of Mind. These cards detail its core computational variants and applications in modern AI research.
The Sally-Anne Test
The classic first-order false belief task used in developmental psychology and adapted for AI. In the standard scenario:
- Sally places a marble in a basket and leaves.
- Anne moves the marble to a box.
- The AI must predict where Sally will look for the marble upon returning.
A correct answer ('the basket') demonstrates the AI attributes to Sally a belief that differs from reality. This tests basic representational Theory of Mind, a prerequisite for cooperative agents that must track user knowledge.
Second-Order False Belief
A more complex variant testing recursive mental state attribution. The AI must understand what one agent thinks about another agent's beliefs.
Example: John sees Mary put chocolate in drawer A. Mary leaves. John moves the chocolate to drawer B. Mary returns, but does not see John. The AI must answer: 'Where does John think Mary will look for the chocolate?'
Correctly answering 'drawer A' requires modeling John's model of Mary's belief. This is critical for AI in negotiation, diplomacy, or multi-layer adversarial scenarios where anticipating others' expectations is key.
The Smarties Task (Unexpected Contents)
Tests understanding that others can hold false beliefs about object properties. A classic setup:
- Show an agent a Smarties tube (or familiar container).
- Reveal it contains pencils, not candy.
- Ask: 'What will a new agent, who hasn't seen inside, think is in the tube?'
Success requires the AI to separate its own updated knowledge from the naive belief of another. This is directly applicable to user modeling in AI assistants, where the system knows a fact (e.g., a flight is cancelled) but must anticipate and address the user's initial, uninformed state.
Computational Frameworks & Inverse Planning
AI systems solve false belief tasks not through intuition, but via formal Bayesian inference and inverse planning. The core approach:
- Model the Agent: Treat the other agent as a rational planner with partial knowledge.
- Infer Belief State: Use Bayesian reasoning to infer the most likely world state the other agent believes, given their perceptual history.
- Predict Action: Simulate the agent's decision-making under that inferred belief.
Frameworks like Bayesian Theory of Mind (BToM) implement this, allowing AI to pass false belief tests by explicitly calculating: P(Belief | Observed Actions, World Dynamics). This provides a transparent, debuggable mechanism for mental state attribution.
Applications in Human-AI Collaboration
False belief reasoning is not an academic exercise; it's operationalized in collaborative AI systems:
- Helpful AI Assistants: An agent that knows you missed an email can infer your false belief about meeting details and proactively correct it.
- Interactive Story Generation: Characters with consistent, evolving beliefs that may be false create coherent, engaging narratives.
- Multi-Agent Coordination: In a team of AI and humans, agents must track who knows what to avoid redundant communication or dangerous misinformation.
- Robotic Instruction: A robot must understand that a human pointing to a tool's usual location holds a false belief if the tool has been moved, and must seek clarification.
Limitations & The 'Folk Psychology' Gap
Current AI performance on false belief tasks reveals significant gaps:
- Brittleness: Models often fail on slight narrative variations, showing a lack of robust, generalized understanding.
- Lack of Deep Causality: Many systems pass via surface-level pattern matching from training data, not genuine causal reasoning about perception and belief.
- The 'Folk Psychology' Problem: Humans use a rich, intuitive framework of mental concepts. AI lacks this integrated model, making its reasoning fragmented and computationally expensive.
True progress requires moving beyond narrow task benchmarks to building integrated cognitive architectures that naturally support mental state reasoning as part of general world understanding.
How is it Implemented in AI Systems?
In AI, the false belief task is operationalized as a benchmark for evaluating a system's capacity for mental state attribution, a core component of Theory of Mind (ToM).
Implementation typically involves presenting a language model or multi-agent system with a narrative where a character holds an outdated or incorrect belief. The system must then answer questions or predict actions based on the character's subjective mental state, not objective reality. This tests the model's ability to maintain and reason over distinct belief partitions, separating its own knowledge from attributed beliefs.
Advanced implementations use recursive modeling frameworks, where an agent's internal world model includes explicit variables representing other agents' beliefs. Evaluation is performed through specialized datasets like ToMi or FANToM, which measure the model's success rate on first-order and higher-order false belief scenarios, providing a quantitative metric for social reasoning capability.
Frequently Asked Questions
A false belief task is a standard test in developmental psychology and AI used to assess whether an entity understands that others can hold beliefs about the world that differ from reality. This FAQ addresses its role in evaluating Theory of Mind in artificial intelligence systems.
A false belief task is an experimental paradigm designed to test an entity's capacity for Theory of Mind (ToM)—specifically, its understanding that another agent can hold a belief about the world that is contradicted by the true state of reality. In AI, passing such a task demonstrates a system's ability to attribute and reason about the divergent mental states of other agents, which is foundational for sophisticated multi-agent cooperation, strategic reasoning, and natural language pragmatics. The classic test involves a scenario where a character (e.g., Sally) places an object in a location (Location A) and leaves. Another character (e.g., Anne) then moves the object to a new location (Location B) unbeknownst to Sally. The critical question is: "Where will Sally look for her object?" An entity with a functional ToM must answer "Location A," correctly inferring Sally's false belief based on her outdated knowledge.
Enabling Efficiency, Speed & Accuracy
Intelligent Analysis, Decision & Execution
We build AI systems for teams that need search across company data, workflow automation across tools, or AI features inside products and internal software.
Talk to Us
Search across company data
Give teams answers from docs, tickets, runbooks, and product data with sources and permissions.
Useful when people spend too long searching or get different answers from different systems.

Automate internal workflows
Use AI to route work, draft outputs, trigger actions, and keep approvals and logs in place.
Useful when repetitive work moves across multiple tools and teams.

Add AI to products and internal tools
Build assistants, guided actions, or decision support into the software your team or customers already use.
Useful when AI needs to be part of the product, not a separate tool.
Related Terms
The False Belief Task is a foundational concept for assessing mental state attribution. These related terms define the broader cognitive and computational frameworks for modeling the beliefs, intentions, and knowledge of other agents.
Theory of Mind (ToM)
Theory of Mind (ToM) is the cognitive capacity to attribute mental states—such as beliefs, desires, intentions, and knowledge—to oneself and others. This ability enables the prediction and explanation of behavior, forming the foundational concept that the False Belief Task is designed to test.
- In AI, implementing ToM means building systems that can maintain and reason about the distinct, potentially inaccurate, internal models of other agents.
- This is critical for cooperative multi-agent systems, human-AI collaboration, and any scenario requiring strategic interaction.
Belief-Desire-Intention (BDI) Model
The Belief-Desire-Intention (BDI) model is a classic software architecture for intelligent agents that structures decision-making around three key components:
- Beliefs: The agent's knowledge about the world (which may be true or false).
- Desires: The agent's goals or objectives.
- Intentions: The plans or courses of action to which the agent has committed.
This model provides a formal framework for implementing agents that can reason about their own and others' false beliefs, as beliefs are explicitly represented and can diverge from ground truth.
Recursive Modeling
Recursive modeling is a computational approach where an agent models not only the state of the world but also the internal models of other agents. These models can be nested to multiple levels (e.g., 'I think that you think that I think...').
- This is essential for passing higher-order False Belief Tasks and for sophisticated strategic reasoning in games like poker or negotiation.
- The computational complexity grows exponentially with the depth of recursion, often requiring approximations in practical AI systems.
Inverse Planning
Inverse planning (or Bayesian inverse reinforcement learning) is a method for inferring an agent's hidden goals and beliefs by reasoning backwards from its observed actions. It assumes the observed agent is approximately rational in planning to achieve its goals.
- This is a core computational technique for intent recognition and plan recognition.
- To infer a false belief, an inverse planner would posit that the observed agent is acting rationally given its inaccurate world model, then deduce what that model must be.
Multi-Agent Epistemic Logic
Multi-agent epistemic logic is a formal logical system used to rigorously reason about the knowledge and beliefs of multiple interacting agents. It includes operators like 'Agent A knows that P' and 'Agent A believes that P,' and can encode higher-order statements.
- This formalism allows precise definition of concepts like common knowledge and mutual belief.
- It provides the mathematical underpinnings for specifying and verifying properties in systems where False Belief Tasks are relevant, ensuring logical consistency in mental state attributions.
Simulation Theory vs. Theory-Theory
These are two leading cognitive science hypotheses for how humans perform mental state attribution, both relevant to AI architecture choices:
- Simulation Theory: Proposes understanding others by mentally simulating their situation using one's own cognitive machinery. In AI, this could involve using the agent's own world model but with different initial beliefs or goals.
- Theory-Theory: Proposes using an innate or learned 'folk psychology' theory to make inferences about internal states. In AI, this aligns with learning a separate predictive model of other agents' behavior.
The design of an AI's mindreading capability often reflects a blend of these approaches.

About the author
Prasad Kumkar
CEO & MD, Inference Systems
Prasad Kumkar is the CEO & MD of Inference Systems and writes about AI systems architecture, LLM infrastructure, model serving, evaluation, and production deployment. Over 5+ years, he has worked across computer vision models, L5 autonomous vehicle systems, and LLM research, with a focus on taking complex AI ideas into real-world engineering systems.
His work and writing cover AI systems, large language models, AI agents, multimodal systems, autonomous systems, inference optimization, RAG, evaluation, and production AI engineering.
Partnered with leading AI, data, and software stack.
How We Work
Custom AI workflows for your Business
One-fit-all AI don't work for modern businesses. At Inferensys, we aim to understand your business & custom requirements; which we use to define most efficient agentic workflows, the data, and the tools for your business.
01
Review the use case
We understand the task, the users, and where AI can actually help.
Read more02
Pick the right approach
We define what needs search, automation, or product integration.
Read more03
Build the first useful version
We implement the part that proves the value first.
Read more04
Improve from there
We add the checks and visibility needed to keep it useful.
Read moreThe first call is a practical review of your use case and the right next step.
Talk to Us