First-order Theory of Mind (ToM) is the cognitive capacity to attribute basic mental states—such as beliefs, desires, intentions, and knowledge—to another agent. This allows a system to model that another entity may have a perspective, goal, or understanding of the world that differs from its own or from objective reality. In AI, this is a critical component for enabling cooperative behavior, intent recognition, and effective communication in multi-agent systems.
Glossary
First-Order Theory of Mind

What is First-Order Theory of Mind?
First-order Theory of Mind is a foundational capability in artificial intelligence and cognitive science, enabling an agent to model the basic mental states of others.
This capability is distinct from higher-order Theory of Mind, which involves recursive reasoning about mental states (e.g., "I think that you think..."). First-order ToM is often evaluated using tasks like the false belief task, where an agent must predict an action based on another's incorrect belief. Implementing it computationally involves techniques like inverse planning and recursive modeling, forming the basis for more sophisticated social cognition and strategic reasoning in autonomous agents.
Core Characteristics of First-Order Theory of Mind
First-order Theory of Mind is the foundational cognitive capacity enabling an agent to attribute basic mental states—like beliefs, knowledge, and desires—to another agent. These are its defining operational characteristics.
Single-Mind Attribution
First-order Theory of Mind involves attributing a mental state to one other agent. The attributing agent forms a model that another agent holds a specific belief, desire, or intention. This is distinct from higher-order Theory of Mind, which involves reasoning about mental states about mental states (e.g., 'Alice believes that Bob believes X').
- Key Limitation: It does not support recursive modeling.
- Example: A self-driving car predicting that a pedestrian believes the crosswalk signal is green, and will therefore step into the road.
Representation of False Belief
A critical test for first-order ToM is understanding that another agent can hold a belief that contradicts reality (a false belief). The system must maintain two conflicting representations:
- The true state of the world.
- The other agent's (potentially incorrect) belief state about the world.
This capability is essential for deception, cooperation with misinformed partners, and explaining mistaken actions. The classic Sally-Anne test from developmental psychology is the benchmark for this capacity.
Goal-Directed Action Prediction
Agents with first-order ToM do not just predict actions based on environmental stimuli; they predict actions based on inferred internal goals. The reasoning chain is:
- Observe Agent B's actions and context.
- Infer Agent B's likely goal or desire.
- Infer Agent B's belief about how to achieve that goal.
- Predict Agent B's future actions based on that belief-desire pair.
This moves prediction from simple behavioral pattern matching to model-based intention recognition.
Separation of Self and Other Models
The system must maintain a clear ontological distinction between its own knowledge/beliefs and its model of the other agent's knowledge/beliefs. This requires:
- Egocentric Model: The agent's own, presumably accurate, view of the world.
- Allocentric Model: A separate, attributed view representing the other agent's perspective.
Failure of this separation leads to egocentric bias, where the agent incorrectly assumes others know what it knows, a common failure mode in early AI systems and young children.
Foundational for Communication
First-order ToM is a prerequisite for pragmatic language understanding. To interpret an utterance, an agent must model the speaker's:
- Informational Goal: What they are trying to convey (communicative intent).
- Knowledge State: What they believe the listener already knows (modeled via ToM).
This enables understanding of non-literal speech, sarcasm, and indirect requests. It underpins the Gricean Maxims of cooperative conversation, as effective communication requires estimating the listener's mental state.
Implementation via Inverse Planning
A dominant computational method for achieving first-order ToM is inverse planning (or Bayesian inverse reinforcement learning). The agent solves:
'Given that I observed these actions from Agent B, what goals and beliefs would make these actions rational?'
- Process: The observer assumes the other agent is a bounded rational planner.
- Mechanism: It uses a generative model of planning (a 'forward' model) and runs it in reverse via probabilistic inference.
- Output: A posterior distribution over the other agent's possible goals and world beliefs.
First-Order vs. Higher-Order Theory of Mind
A comparison of the computational and representational characteristics distinguishing first-order mental state attribution from its higher-order, recursive counterparts.
| Cognitive Feature | First-Order Theory of Mind | Second-Order Theory of Mind | Nth-Order Theory of Mind (n>2) |
|---|---|---|---|
Representational Depth | Attributes simple mental states (e.g., 'Bob believes X'). | Attributes mental states about mental states (e.g., 'Alice believes that Bob believes X'). | Attributes recursively nested mental states to depth n (e.g., 'I think that you think that I think...'). |
Formal Notation (Epistemic Logic) | Bₐ p (Agent a believes proposition p). | Bₐ B_b p (Agent a believes that agent b believes p). | Bₐ B_b Bₐ ... p (Nested belief operators). |
Primary Computational Use Case | Basic intent recognition, plan prediction, and cooperative task alignment in simple multi-agent systems. | Strategic reasoning in adversarial games, understanding deception, and managing simple mutual beliefs. | Modeling complex social hierarchies, negotiating common knowledge, and advanced coalition formation. |
Inference Complexity | Linear in the number of agents; requires a single inference step about another's internal state. | Quadratic; requires modeling another agent's inference process. | Exponential; recursive nesting dramatically increases the state space for plausible mental models. |
Example Test (Psychology/AI) | Passing a standard Sally-Anne false belief task. | Understanding stories or playing games requiring reasoning about another's beliefs about your beliefs. | Solving the 'matching pennies' game or understanding layered bluffing in poker beyond the second round. |
Typical Implementation in AI Agents | Bayesian inverse planning or fine-tuned language model classifiers for single-agent intent. | Recursive modeling frameworks or nested simulation within a bounded depth. | Limited-depth search with pruning; full Nth-order reasoning is computationally intractable for large n. |
Relation to Common Knowledge | Insufficient. Establishes only that agents have individual beliefs. | Insufficient. Establishes mutual belief (everyone believes that everyone believes). | Theoretically required. Common knowledge is an infinite recursion of mutual beliefs (everyone knows that everyone knows...). |
Impact on Agent Communication | Enables basic informative communication and clarification of false beliefs. | Enables sophisticated communication acts like irony, bluffing, and understanding communicative intent. | Enables the establishment of conventions and norms that rely on everyone knowing that everyone knows the rules. |
Frequently Asked Questions
First-order Theory of Mind (ToM) is a foundational capability for artificial intelligence, enabling systems to model the beliefs, knowledge, and intentions of other agents. This FAQ addresses its core mechanisms, technical implementation, and role in multi-agent and cooperative AI systems.
First-order Theory of Mind (ToM) is the computational ability of an artificial intelligence agent to attribute a single layer of mental states—such as beliefs, desires, intentions, or knowledge—to another agent. It allows an AI to model that another entity has an internal perspective that may differ from objective reality or its own perspective. For example, an agent with first-order ToM can understand that "Alice believes the package is in Room A," even if the agent itself knows the package has been moved to Room B. This capability is distinct from zero-order reasoning (which lacks any mental state modeling) and second-order or higher-order ToM (which involves recursive modeling, e.g., "Alice believes that Bob believes X"). In AI, this is typically implemented as a form of belief attribution within an agent's internal world model, often using Bayesian inference or learned representations to predict another agent's likely knowledge given their perceptual history and actions.
Enabling Efficiency, Speed & Accuracy
Intelligent Analysis, Decision & Execution
We build AI systems for teams that need search across company data, workflow automation across tools, or AI features inside products and internal software.
Talk to Us
Search across company data
Give teams answers from docs, tickets, runbooks, and product data with sources and permissions.
Useful when people spend too long searching or get different answers from different systems.

Automate internal workflows
Use AI to route work, draft outputs, trigger actions, and keep approvals and logs in place.
Useful when repetitive work moves across multiple tools and teams.

Add AI to products and internal tools
Build assistants, guided actions, or decision support into the software your team or customers already use.
Useful when AI needs to be part of the product, not a separate tool.
Related Terms
First-order Theory of Mind is a foundational capability within a broader landscape of cognitive and social reasoning. These related concepts define the mechanisms, tests, and higher-order extensions of mental state attribution.
Theory of Mind (ToM)
Theory of Mind (ToM) is the overarching cognitive capacity to attribute mental states—such as beliefs, desires, intentions, and knowledge—to oneself and others. This ability enables the prediction and explanation of behavior. First-order ToM is its most basic, essential level.
- Core Function: Forms the basis for social intelligence and cooperative interaction.
- Computational Goal: To build agents that can engage in meaningful collaboration by understanding not just what others do, but why they do it.
False Belief Task
A false belief task is the standard empirical test, from developmental psychology, used to assess first-order Theory of Mind. It evaluates whether an entity understands that others can hold beliefs that contradict reality.
- Classic Example (Sally-Anne): Sally places a marble in a basket and leaves. Anne moves the marble to a box. The test question is: Where will Sally look for her marble? A correct answer ("the basket") demonstrates an understanding of Sally's false belief.
- AI Benchmark: This task is used to evaluate the social reasoning capabilities of artificial agents and language models.
Second-Order Theory of Mind
Second-order Theory of Mind is the recursive ability to attribute mental states about mental states. It involves reasoning like, "Alice believes that Bob believes X." This is critical for deception, bluffing, and complex coordination.
- Key Difference: First-order: "I think you think X." Second-order: "I think you think I think X."
- Strategic Necessity: Essential for agents operating in competitive multi-agent environments like trading, negotiation, or adversarial games where predicting an opponent's prediction of your move is vital.
Mental State Attribution
Mental state attribution is the active computational process of ascribing specific internal cognitive or emotional states to another agent. First-order ToM is its primary manifestation.
- Process Flow: Observe behavior/communication → Infer underlying beliefs, knowledge, goals → Predict future actions.
- Technical Implementation: Often involves Bayesian inverse planning or fine-tuned language models that map observable cues to a distribution over possible hidden mental states.
Belief-Desire-Intention (BDI) Model
The Belief-Desire-Intention (BDI) model is a foundational software architecture for intelligent agents that operationalizes Theory of Mind concepts for decision-making.
- Beliefs: The agent's knowledge about the world (including beliefs about others' beliefs).
- Desires: The agent's goals or motivational state.
- Intentions: Committed plans of action chosen to achieve desires.
- Architectural Link: A first-order ToM-capable agent uses its BDI architecture to model the BDI states of other agents, creating nested agent models for prediction.
Inverse Planning
Inverse planning is a primary Bayesian computational method for implementing mental state attribution. It works backwards from observed actions to infer the hidden goals and beliefs that likely caused them, assuming the observed agent is approximately rational.
- Mechanism: Given a sequence of actions and a model of how a rational planner would act, it calculates the posterior probability over possible goals and world beliefs.
- Application: Core to plan recognition and intent inference in collaborative and assistive AI systems, allowing an agent to discern a human user's objective without explicit instruction.

About the author
Prasad Kumkar
CEO & MD, Inference Systems
Prasad Kumkar is the CEO & MD of Inference Systems and writes about AI systems architecture, LLM infrastructure, model serving, evaluation, and production deployment. Over 5+ years, he has worked across computer vision models, L5 autonomous vehicle systems, and LLM research, with a focus on taking complex AI ideas into real-world engineering systems.
His work and writing cover AI systems, large language models, AI agents, multimodal systems, autonomous systems, inference optimization, RAG, evaluation, and production AI engineering.
Partnered with leading AI, data, and software stack.
How We Work
Custom AI workflows for your Business
One-fit-all AI don't work for modern businesses. At Inferensys, we aim to understand your business & custom requirements; which we use to define most efficient agentic workflows, the data, and the tools for your business.
01
Review the use case
We understand the task, the users, and where AI can actually help.
Read more02
Pick the right approach
We define what needs search, automation, or product integration.
Read more03
Build the first useful version
We implement the part that proves the value first.
Read more04
Improve from there
We add the checks and visibility needed to keep it useful.
Read moreThe first call is a practical review of your use case and the right next step.
Talk to Us