Deception detection is the computational task of identifying when an agent is intentionally communicating false information or concealing the truth. It is a specialized subfield of Theory of Mind (ToM) modeling, where an AI system must infer the mental state of another entity—specifically, the intent to deceive. This involves analyzing behavioral cues, logical inconsistencies in narratives, or deviations from established patterns of communication. Effective detection is foundational for robust multi-agent systems, cybersecurity (adversarial mindreading), and applications requiring high-integrity social interaction.
