Inferensys

Glossary

Neural Predicate Invention

Neural predicate invention is a neuro-symbolic AI process where a system automatically discovers and defines new symbolic concepts or relations (predicates) to explain observed data or solve tasks.
Developer building agentic RAG system, retrieval pipeline diagram on laptop, technical workspace with notes.
NEURO-SYMBIC AI

What is Neural Predicate Invention?

Neural predicate invention is a core capability within neuro-symbolic AI systems, enabling the automatic discovery of new symbolic concepts.

Neural predicate invention is the process by which a neuro-symbolic AI system automatically discovers and defines new symbolic concepts or relations—called predicates—that are useful for explaining observed data or solving a task. It bridges inductive logic programming and deep learning, where a neural network proposes candidate predicates from raw data, and a symbolic reasoner evaluates their utility for logical inference or knowledge base completion. This allows the system to extend its own symbolic knowledge representation beyond initially provided primitives.

The process is often framed as a joint optimization problem, where the neural component learns distributed representations that suggest latent regularities, and the symbolic component tests these as hypotheses against logical constraints. Successful invention creates compact, generalizable abstractions, reducing the complexity of learned rules. This is critical for scaling explainable AI to complex domains where relevant concepts are not pre-defined, enabling autonomous knowledge acquisition and more efficient relational reasoning.

NEURO-SYMBIC AI

Key Characteristics of Neural Predicate Invention

Neural predicate invention is the process by which a neuro-symbolic system automatically discovers and defines new symbolic concepts or relations (predicates) that are useful for explaining observed data or solving a task.

01

Automated Concept Discovery

The core function is the unsupervised or weakly-supervised generation of new symbolic predicates. The system analyzes data patterns and proposes new logical concepts (e.g., is_metastable(state) in chemistry or upward_trend(metric, window) in finance) that were not present in the initial knowledge base. This moves beyond predefined ontologies.

  • Process: Often involves searching a space of possible predicate definitions.
  • Goal: To find concepts that compactly explain observations or improve task performance.
02

Differentiable Symbolic Representation

Invented predicates are represented in a form compatible with gradient-based learning. Unlike static symbols in classic AI, these predicates are often embedded as continuous vectors or implemented as neural modules. This allows the system to refine the meaning and utility of a new predicate through backpropagation based on task loss.

  • Key Benefit: The invented concept's definition can be softly adjusted during training for optimal integration with other neural components.
03

Integration with Logical Reasoning

Newly invented predicates are immediately operational within a symbolic reasoning framework. They can be used as atoms in first-order logic rules, incorporated into a knowledge graph, or serve as conditions in a production system. For example, an invented predicate risk_factor(patient, X) can be used in a learned rule: IF risk_factor(patient, high) AND age(patient, >65) THEN recommend(test, Y).

  • This creates a closed loop where reasoning guides invention and new inventions enhance reasoning.
04

Driven by Data Compression & Utility

Invention is not arbitrary; it's guided by objective functions related to explanatory power and task utility. Common drivers include:

  • Minimum Description Length (MDL): Prefers predicates that allow the overall system (data + theory) to be described more succinctly.
  • Improvement in Predictive Accuracy: A new predicate is retained if it improves the model's performance on a target task (e.g., classification, planning success).
  • Increase in Inferential Efficiency: The predicate speeds up logical deduction or query answering.
05

Bridges Perception and Reasoning

This process often acts as the critical link between sub-symbolic perception (e.g., raw images, sensor data, text) and high-level reasoning. A neural network perceives the world and invents symbolic abstractions to describe it. These abstractions are then manipulated by a logical reasoner to make decisions or draw conclusions.

  • Example: A vision system watching blocks-world videos might invent the predicate supports(object_a, object_b) to logically reason about stability, going beyond raw pixel data.
06

Enables Knowledge Base Extension

It provides a mechanism for autonomous knowledge acquisition. Starting with a seed knowledge base, the system can propose new facts and relations using invented predicates, thereby extending its own symbolic knowledge. This is a step towards lifelong learning in neuro-symbolic systems, where the agent's understanding of the world becomes richer and more structured over time without manual engineering.

  • Contrasts with static knowledge bases that require human curation.
NEURO-SYMBIC AI

How Neural Predicate Invention Works

Neural predicate invention is the process by which a neuro-symbolic system automatically discovers and defines new symbolic concepts or relations (predicates) that are useful for explaining observed data or solving a task.

Neural predicate invention is a core capability of neuro-symbolic AI where a system autonomously hypothesizes new symbolic concepts—called predicates—to improve its reasoning. Unlike systems with fixed symbolic vocabularies, it uses neural networks to detect latent patterns in data and proposes new logical terms, such as is_metastable(compound) in chemistry or upstream_supplier(company) in logistics. This process bridges sub-symbolic learning with symbolic abstraction, creating an evolving knowledge representation.

The invention mechanism typically involves a search in the space of possible predicate definitions, guided by a neural network that scores candidates based on their utility for compressing data or solving a task. Techniques like differentiable inductive logic programming (∂ILP) allow gradient-based learning of these invented rules. The system evaluates new predicates by their ability to make the existing symbolic theory more consistent, predictive, or compact, enabling automated knowledge discovery without human pre-definition of all relevant concepts.

NEURAL PREDICATE INVENTION

Frequently Asked Questions

Neural predicate invention is a core capability of neuro-symbolic AI, enabling systems to autonomously discover new concepts. These FAQs address its mechanisms, applications, and distinctions from related techniques.

Neural predicate invention is the process by which a neuro-symbolic AI system automatically discovers and defines new symbolic concepts or relations—called predicates—that are useful for explaining observed data or solving a task. It works by integrating a neural network's ability to detect latent patterns in data with a symbolic system's capacity for logical representation. The neural component identifies recurring structures or clusters in the input, which are then formulated as candidate predicates (e.g., is_metastable(state) or causes_interference(signal_a, signal_b)). These new predicates are evaluated for their utility in improving the system's performance on a target objective, such as prediction accuracy or plan efficiency, and are subsequently integrated into the system's symbolic knowledge base for future reasoning.

Prasad Kumkar

About the author

Prasad Kumkar

CEO & MD, Inference Systems

Prasad Kumkar is the CEO & MD of Inference Systems and writes about AI systems architecture, LLM infrastructure, model serving, evaluation, and production deployment. Over 5+ years, he has worked across computer vision models, L5 autonomous vehicle systems, and LLM research, with a focus on taking complex AI ideas into real-world engineering systems.

His work and writing cover AI systems, large language models, AI agents, multimodal systems, autonomous systems, inference optimization, RAG, evaluation, and production AI engineering.