Inferensys

Glossary

Digital Twin Definition Language (DTDL)

The Digital Twin Definition Language (DTDL) is an open modeling language, developed by Microsoft, used to define the capabilities, components, relationships, and telemetry interfaces of digital twins to ensure interoperability.
MLOps engineer reviewing model serving infrastructure on laptop, container orchestration visible, technical workspace.
GLOSSARY

What is Digital Twin Definition Language (DTDL)?

A precise definition of the open modeling language for digital twins.

The Digital Twin Definition Language (DTDL) is an open, JSON-LD-based modeling language, developed by Microsoft, used to define the components, properties, telemetry, commands, and relationships of a digital twin in a standardized, machine-readable format. Its primary purpose is to ensure semantic interoperability between different digital twin platforms and applications by providing a common schema for describing a twin's capabilities and its connections within a twin graph. DTDL models are the foundational blueprints for creating and interacting with digital twins in frameworks like Azure Digital Twins.

DTDL enables the creation of complex, hierarchical models where a digital twin can be composed of other twins or components, each with its own defined interface. By standardizing the definition of telemetry (data streams), properties (read-write or read-only state), and commands (methods to invoke), DTDL allows developers and systems to understand and interact with any compliant digital twin predictably. This formal specification is critical for building scalable digital twin ecosystems and is a key enabler for bidirectional data flow and advanced analytics.

CORE LANGUAGE SPECIFICATION

Key Features of DTDL

The Digital Twin Definition Language (DTDL) is an open modeling language, developed by Microsoft, used to define the capabilities, components, relationships, and telemetry interfaces of digital twins to ensure interoperability. Its key features provide the semantic foundation for building connected, intelligent virtual replicas.

01

Semantic Modeling with Interfaces

At its core, DTDL uses JSON-LD-based interfaces as the fundamental building blocks to define a twin's capabilities. Each interface is a reusable schema that declares:

  • Properties: The read-write or read-only state of the twin (e.g., currentTemperature).
  • Telemetry: Time-series data streams the twin emits (e.g., vibrationSensorReadings).
  • Commands: Actions that can be invoked on the twin (e.g., reboot).
  • Relationships: Links to other digital twins, defining a graph network. This structured approach ensures every piece of data has a well-defined, machine-readable meaning.
02

Inheritance and Component Composition

DTDL supports interface inheritance, allowing complex models to extend simpler ones, promoting reuse. More critically, it enables component composition via the component schema. A complex asset, like an industrial pump, can be modeled as a top-level twin composed of component twins for its motor, bearing, and controller. Each component is defined by its own interface, enabling modular design, independent lifecycle management, and granular telemetry analysis. This mirrors real-world system hierarchies.

03

Relationship Graph Definition

Beyond properties and telemetry, DTDL explicitly models relationships between twins. Relationships are first-class citizens with:

  • A name (e.g., contains, fedBy, installedIn).
  • A target (the twin being related to).
  • Their own properties. This allows the construction of a twin graph—a network of interconnected digital twins that represents a physical system's topology. This graph is essential for spatial queries ("find all valves in room B12") and understanding system-wide cause-and-effect.
05

Integration with IoT and Simulation

DTDL is designed to bridge the physical and virtual worlds. It integrates seamlessly with:

  • IoT Protocols: Telemetry defined in DTDL maps directly to data ingested via MQTT or OPC UA, often using a Unified Namespace (UNS) architecture for contextualization.
  • Simulation Models: A DTDL interface can describe the inputs, outputs, and parameters of a physics-based model or surrogate model, enabling it to be invoked for what-if analysis or predictive maintenance within the twin.
  • Control Systems: Commands defined in DTDL can trigger actions in Programmable Logic Controllers (PLCs) or software, enabling bidirectional data flow.
06

Foundation for Advanced Capabilities

A well-defined DTDL model is the prerequisite for advanced digital twin functionalities:

  • Cognitive Twins: The structured data provides clean, labeled inputs for machine learning models that add reasoning and optimization.
  • Twin Graph Analytics: Relationships enable graph-based queries and algorithms for root-cause analysis.
  • Lifecycle Management: The model serves as a single source of truth from design (virtual commissioning) through operation to decommissioning, supporting the digital thread.
  • Edge Deployment: Models can be compiled into lightweight runtimes for edge twins that operate with low latency.
CORE MECHANISM

How DTDL Works

The Digital Twin Definition Language (DTDL) is the open modeling language that defines the structure, capabilities, and interactions of a digital twin.

Digital Twin Definition Language (DTDL) is an open modeling language, based on JSON-LD, used to define the semantic model of a digital twin. A DTDL model describes a twin's components, properties, telemetry streams, commands, and relationships to other twins using a standardized vocabulary. This model acts as a contract, ensuring that any system or application that understands DTDL can interpret the twin's data and capabilities correctly, enabling semantic interoperability across different platforms and vendors.

DTDL works by providing a schema that digital twin platforms, like Microsoft Azure Digital Twins, use to instantiate and manage twin instances. When telemetry data arrives, it is contextualized against the DTDL model, transforming raw sensor readings into meaningful, property-based updates. The language also supports inheritance and complex schemas, allowing for the creation of reusable interfaces and the modeling of intricate systems. This structured definition is foundational for building twin graphs and executing what-if analysis.

APPLICATION DOMAINS

Where is DTDL Used?

The Digital Twin Definition Language (DTDL) provides a standardized foundation for building interoperable digital twins. Its primary application is in modeling complex industrial and business systems for simulation, monitoring, and autonomous control.

01

Industrial IoT & Smart Manufacturing

DTDL is foundational in Industry 4.0 for modeling production lines, robots, and CNC machines. It defines:

  • Telemetry interfaces for sensor data (temperature, vibration).
  • Properties for machine state (operational, idle, faulted).
  • Relationships between machines, conveyors, and work cells. This enables predictive maintenance and virtual commissioning by creating a semantic model of the entire factory floor that integrates with platforms like Azure Digital Twins.
02

Smart Buildings & Facilities Management

Used to create comprehensive models of building systems for energy optimization and occupant comfort. DTDL models define:

  • Components like HVAC units, lighting panels, and occupancy sensors.
  • Commands for adjusting thermostat setpoints or dimming lights.
  • Relationships mapping sensors to controlled zones. This allows facility managers to run what-if analysis on energy usage and enables autonomous systems to maintain environmental conditions while minimizing power consumption.
03

Energy Grids & Utility Networks

Critical for modeling distributed energy resources (DERs) like wind turbines, solar inverters, and substations. DTDL enables:

  • Representation of hierarchical relationships across the grid (transmission → distribution → consumer).
  • Definition of complex telemetry for power flow, voltage, and frequency.
  • Creation of federated twin architectures where different grid segments have their own twin instances. These models support smart grid energy optimization and stability analysis by providing a unified semantic layer for grid assets.
04

Supply Chain & Logistics

Applied to model the end-to-end flow of goods, vehicles, and warehouses. DTDL defines:

  • Assets like shipping containers, autonomous mobile robots (AMRs), and forklifts.
  • Geospatial properties for real-time location tracking.
  • Event-driven commands for rerouting shipments or adjusting inventory levels. This supports autonomous supply chain intelligence, allowing for dynamic exception handling and optimization of logistics networks based on live twin data.
05

Connected Vehicles & Transportation

Used to create digital twins of vehicles, fleets, and traffic infrastructure. DTDL models capture:

  • Vehicle components such as the powertrain, battery pack, and ADAS sensors.
  • Telemetry streams for GPS location, speed, battery state of charge, and diagnostic codes.
  • Relationships between vehicles, charging stations, and traffic management systems. This enables heterogeneous fleet orchestration, predictive maintenance for fleets, and simulation of traffic flow scenarios.
06

Healthcare & Medical Device Integration

Emerging use case for modeling hospital wards, medical equipment, and patient flow. DTDL can define:

  • Device twins for MRI machines, infusion pumps, and patient monitors with their operational states and alerts.
  • Spatial relationships between beds, devices, and nursing stations.
  • Privacy-aware properties that comply with regulations like HIPAA. This facilitates clinical workflow automation, asset tracking, and simulation of patient throughput to optimize resource allocation.
FEATURE COMPARISON

DTDL vs. Other Modeling Standards

A comparison of the Digital Twin Definition Language (DTDL) with other prominent standards for modeling physical assets and systems, focusing on core architectural features.

FeatureDigital Twin Definition Language (DTDL)Asset Administration Shell (AAS)OPC UA Information Modeling

Primary Developer / Standard Body

Microsoft (Open Source)

Industrial Digital Twin Association (IDTA) / Plattform Industrie 4.0

OPC Foundation

Core Modeling Paradigm

JSON-LD-based language for defining twin capabilities, telemetry, and relationships

XML-based submodel templates defining asset properties and operations

Object-oriented node-based information model with type definitions

Primary Use Case

Defining interoperable digital twins for Azure Digital Twins and related cloud services

Semantic interoperability for Industry 4.0 and smart manufacturing assets

Semantic data modeling for industrial automation and device communication

Native Support for Relationships & Graphs

Built-in Telemetry & Command Interfaces

Open Source Specification

Cloud-Native First Design

Tight Integration with IoT Protocols (e.g., MQTT)

Formal Ontology / Semantic Layer

Uses JSON-LD for semantic definitions

Uses ECLASS or custom ontologies within submodels

Built-in semantic referencing via OPC UA companion specifications

Typical Deployment Context

Azure cloud platform and associated services

On-premise or hybrid industrial environments

Factory floor, SCADA systems, edge-to-cloud communication

DIGITAL TWIN DEFINITION LANGUAGE

Frequently Asked Questions

The Digital Twin Definition Language (DTDL) is the open modeling language for defining the semantics of digital twins. These questions address its core purpose, structure, and role in enterprise systems.

The Digital Twin Definition Language (DTDL) is an open modeling language, developed by Microsoft and published under the MIT license, used to define the semantic model of a digital twin, including its properties, telemetry, commands, relationships, and components in a JSON-LD format. It provides a vendor-neutral, machine-readable schema that ensures different systems can create and interpret digital twin data with shared meaning, which is the foundation of semantic interoperability. DTDL is a core component of Microsoft's Azure Digital Twins platform but is designed as an open standard to enable broader ecosystem development.

Prasad Kumkar

About the author

Prasad Kumkar

CEO & MD, Inference Systems

Prasad Kumkar is the CEO & MD of Inference Systems and writes about AI systems architecture, LLM infrastructure, model serving, evaluation, and production deployment. Over 5+ years, he has worked across computer vision models, L5 autonomous vehicle systems, and LLM research, with a focus on taking complex AI ideas into real-world engineering systems.

His work and writing cover AI systems, large language models, AI agents, multimodal systems, autonomous systems, inference optimization, RAG, evaluation, and production AI engineering.