Glossary

First-Order Theory of Mind

First-order Theory of Mind is the cognitive capacity to attribute mental states—such as beliefs, desires, and intentions—to other agents, enabling the prediction and explanation of their behavior.

Get in touch Learn more

Developer reviewing multi-agent chat interface on laptop, agent conversation logs visible, casual coding session at WeWork desk.

AGENTIC COGNITIVE ARCHITECTURES

What is First-Order Theory of Mind?

First-order Theory of Mind is a foundational capability in artificial intelligence and cognitive science, enabling an agent to model the basic mental states of others.

First-order Theory of Mind (ToM) is the cognitive capacity to attribute basic mental states—such as beliefs, desires, intentions, and knowledge—to another agent. This allows a system to model that another entity may have a perspective, goal, or understanding of the world that differs from its own or from objective reality. In AI, this is a critical component for enabling cooperative behavior, intent recognition, and effective communication in multi-agent systems.

This capability is distinct from higher-order Theory of Mind, which involves recursive reasoning about mental states (e.g., "I think that you think..."). First-order ToM is often evaluated using tasks like the false belief task, where an agent must predict an action based on another's incorrect belief. Implementing it computationally involves techniques like inverse planning and recursive modeling, forming the basis for more sophisticated social cognition and strategic reasoning in autonomous agents.

THEORY OF MIND MODELING

Core Characteristics of First-Order Theory of Mind

First-order Theory of Mind is the foundational cognitive capacity enabling an agent to attribute basic mental states—like beliefs, knowledge, and desires—to another agent. These are its defining operational characteristics.

Single-Mind Attribution

First-order Theory of Mind involves attributing a mental state to one other agent. The attributing agent forms a model that another agent holds a specific belief, desire, or intention. This is distinct from higher-order Theory of Mind, which involves reasoning about mental states about mental states (e.g., 'Alice believes that Bob believes X').

Key Limitation: It does not support recursive modeling.
Example: A self-driving car predicting that a pedestrian believes the crosswalk signal is green, and will therefore step into the road.

Representation of False Belief

A critical test for first-order ToM is understanding that another agent can hold a belief that contradicts reality (a false belief). The system must maintain two conflicting representations:

The true state of the world.
The other agent's (potentially incorrect) belief state about the world.

This capability is essential for deception, cooperation with misinformed partners, and explaining mistaken actions. The classic Sally-Anne test from developmental psychology is the benchmark for this capacity.

Goal-Directed Action Prediction

Agents with first-order ToM do not just predict actions based on environmental stimuli; they predict actions based on inferred internal goals. The reasoning chain is:

Observe Agent B's actions and context.
Infer Agent B's likely goal or desire.
Infer Agent B's belief about how to achieve that goal.
Predict Agent B's future actions based on that belief-desire pair.

This moves prediction from simple behavioral pattern matching to model-based intention recognition.

Separation of Self and Other Models

The system must maintain a clear ontological distinction between its own knowledge/beliefs and its model of the other agent's knowledge/beliefs. This requires:

Egocentric Model: The agent's own, presumably accurate, view of the world.
Allocentric Model: A separate, attributed view representing the other agent's perspective.

Failure of this separation leads to egocentric bias, where the agent incorrectly assumes others know what it knows, a common failure mode in early AI systems and young children.

Foundational for Communication

First-order ToM is a prerequisite for pragmatic language understanding. To interpret an utterance, an agent must model the speaker's:

Informational Goal: What they are trying to convey (communicative intent).
Knowledge State: What they believe the listener already knows (modeled via ToM).

This enables understanding of non-literal speech, sarcasm, and indirect requests. It underpins the Gricean Maxims of cooperative conversation, as effective communication requires estimating the listener's mental state.

Implementation via Inverse Planning

A dominant computational method for achieving first-order ToM is inverse planning (or Bayesian inverse reinforcement learning). The agent solves:

'Given that I observed these actions from Agent B, what goals and beliefs would make these actions rational?'

Process: The observer assumes the other agent is a bounded rational planner.
Mechanism: It uses a generative model of planning (a 'forward' model) and runs it in reverse via probabilistic inference.
Output: A posterior distribution over the other agent's possible goals and world beliefs.

COMPARATIVE ANALYSIS

First-Order vs. Higher-Order Theory of Mind

A comparison of the computational and representational characteristics distinguishing first-order mental state attribution from its higher-order, recursive counterparts.

Cognitive Feature	First-Order Theory of Mind	Second-Order Theory of Mind	Nth-Order Theory of Mind (n>2)
Representational Depth	Attributes simple mental states (e.g., 'Bob believes X').	Attributes mental states about mental states (e.g., 'Alice believes that Bob believes X').	Attributes recursively nested mental states to depth n (e.g., 'I think that you think that I think...').
Formal Notation (Epistemic Logic)	Bₐ p (Agent a believes proposition p).	Bₐ B_b p (Agent a believes that agent b believes p).	Bₐ B_b Bₐ ... p (Nested belief operators).
Primary Computational Use Case	Basic intent recognition, plan prediction, and cooperative task alignment in simple multi-agent systems.	Strategic reasoning in adversarial games, understanding deception, and managing simple mutual beliefs.	Modeling complex social hierarchies, negotiating common knowledge, and advanced coalition formation.
Inference Complexity	Linear in the number of agents; requires a single inference step about another's internal state.	Quadratic; requires modeling another agent's inference process.	Exponential; recursive nesting dramatically increases the state space for plausible mental models.
Example Test (Psychology/AI)	Passing a standard Sally-Anne false belief task.	Understanding stories or playing games requiring reasoning about another's beliefs about your beliefs.	Solving the 'matching pennies' game or understanding layered bluffing in poker beyond the second round.
Typical Implementation in AI Agents	Bayesian inverse planning or fine-tuned language model classifiers for single-agent intent.	Recursive modeling frameworks or nested simulation within a bounded depth.	Limited-depth search with pruning; full Nth-order reasoning is computationally intractable for large n.
Relation to Common Knowledge	Insufficient. Establishes only that agents have individual beliefs.	Insufficient. Establishes mutual belief (everyone believes that everyone believes).	Theoretically required. Common knowledge is an infinite recursion of mutual beliefs (everyone knows that everyone knows...).
Impact on Agent Communication	Enables basic informative communication and clarification of false beliefs.	Enables sophisticated communication acts like irony, bluffing, and understanding communicative intent.	Enables the establishment of conventions and norms that rely on everyone knowing that everyone knows the rules.

FIRST-ORDER THEORY OF MIND

Frequently Asked Questions

First-order Theory of Mind (ToM) is a foundational capability for artificial intelligence, enabling systems to model the beliefs, knowledge, and intentions of other agents. This FAQ addresses its core mechanisms, technical implementation, and role in multi-agent and cooperative AI systems.

First-order Theory of Mind (ToM) is the computational ability of an artificial intelligence agent to attribute a single layer of mental states—such as beliefs, desires, intentions, or knowledge—to another agent. It allows an AI to model that another entity has an internal perspective that may differ from objective reality or its own perspective. For example, an agent with first-order ToM can understand that "Alice believes the package is in Room A," even if the agent itself knows the package has been moved to Room B. This capability is distinct from zero-order reasoning (which lacks any mental state modeling) and second-order or higher-order ToM (which involves recursive modeling, e.g., "Alice believes that Bob believes X"). In AI, this is typically implemented as a form of belief attribution within an agent's internal world model, often using Bayesian inference or learned representations to predict another agent's likely knowledge given their perceptual history and actions.

Enabling Efficiency, Speed & Accuracy

Intelligent Analysis, Decision & Execution

We build AI systems for teams that need search across company data, workflow automation across tools, or AI features inside products and internal software.

Talk to Us

Search across company data

Give teams answers from docs, tickets, runbooks, and product data with sources and permissions.

Useful when people spend too long searching or get different answers from different systems.

Enterprise searchRAGPermissions

Automate internal workflows

Use AI to route work, draft outputs, trigger actions, and keep approvals and logs in place.

Useful when repetitive work moves across multiple tools and teams.

AI agentsWorkflow automationGovernance

Add AI to products and internal tools

Build assistants, guided actions, or decision support into the software your team or customers already use.

Useful when AI needs to be part of the product, not a separate tool.

AI integrationDecision supportModel routing

THEORY OF MIND MODELING

Related Terms

First-order Theory of Mind is a foundational capability within a broader landscape of cognitive and social reasoning. These related concepts define the mechanisms, tests, and higher-order extensions of mental state attribution.

Theory of Mind (ToM)

Theory of Mind (ToM) is the overarching cognitive capacity to attribute mental states—such as beliefs, desires, intentions, and knowledge—to oneself and others. This ability enables the prediction and explanation of behavior. First-order ToM is its most basic, essential level.

Core Function: Forms the basis for social intelligence and cooperative interaction.
Computational Goal: To build agents that can engage in meaningful collaboration by understanding not just what others do, but why they do it.

False Belief Task

A false belief task is the standard empirical test, from developmental psychology, used to assess first-order Theory of Mind. It evaluates whether an entity understands that others can hold beliefs that contradict reality.

Classic Example (Sally-Anne): Sally places a marble in a basket and leaves. Anne moves the marble to a box. The test question is: Where will Sally look for her marble? A correct answer ("the basket") demonstrates an understanding of Sally's false belief.
AI Benchmark: This task is used to evaluate the social reasoning capabilities of artificial agents and language models.

Second-Order Theory of Mind

Second-order Theory of Mind is the recursive ability to attribute mental states about mental states. It involves reasoning like, "Alice believes that Bob believes X." This is critical for deception, bluffing, and complex coordination.

Key Difference: First-order: "I think you think X." Second-order: "I think you think I think X."
Strategic Necessity: Essential for agents operating in competitive multi-agent environments like trading, negotiation, or adversarial games where predicting an opponent's prediction of your move is vital.

Mental State Attribution

Mental state attribution is the active computational process of ascribing specific internal cognitive or emotional states to another agent. First-order ToM is its primary manifestation.

Process Flow: Observe behavior/communication → Infer underlying beliefs, knowledge, goals → Predict future actions.
Technical Implementation: Often involves Bayesian inverse planning or fine-tuned language models that map observable cues to a distribution over possible hidden mental states.

Belief-Desire-Intention (BDI) Model

The Belief-Desire-Intention (BDI) model is a foundational software architecture for intelligent agents that operationalizes Theory of Mind concepts for decision-making.

Beliefs: The agent's knowledge about the world (including beliefs about others' beliefs).
Desires: The agent's goals or motivational state.
Intentions: Committed plans of action chosen to achieve desires.
Architectural Link: A first-order ToM-capable agent uses its BDI architecture to model the BDI states of other agents, creating nested agent models for prediction.

Inverse Planning

Inverse planning is a primary Bayesian computational method for implementing mental state attribution. It works backwards from observed actions to infer the hidden goals and beliefs that likely caused them, assuming the observed agent is approximately rational.

Mechanism: Given a sequence of actions and a model of how a rational planner would act, it calculates the posterior probability over possible goals and world beliefs.
Application: Core to plan recognition and intent inference in collaborative and assistive AI systems, allowing an agent to discern a human user's objective without explicit instruction.

About the author

Prasad Kumkar

CEO & MD, Inference Systems

Prasad Kumkar is the CEO & MD of Inference Systems and writes about AI systems architecture, LLM infrastructure, model serving, evaluation, and production deployment. Over 5+ years, he has worked across computer vision models, L5 autonomous vehicle systems, and LLM research, with a focus on taking complex AI ideas into real-world engineering systems.

His work and writing cover AI systems, large language models, AI agents, multimodal systems, autonomous systems, inference optimization, RAG, evaluation, and production AI engineering.

Limited slotsGet a Free AI Consultation

How We Work

Custom AI workflows for your Business

One-fit-all AI don't work for modern businesses. At Inferensys, we aim to understand your business & custom requirements; which we use to define most efficient agentic workflows, the data, and the tools for your business.