Inferensys

Integration

AI Integration with Ignition for Cloud Deployment

Architectures for deploying scalable AI models alongside Ignition in cloud or hybrid environments, focusing on secure data exchange, model versioning, and high-value manufacturing inference workflows.
ML engineer managing model versions on laptop, version history visible, technical Git-like workflow.
SCALABLE INFERENCE, SECURE DATA EXCHANGE

Where AI Fits in a Cloud-Enabled Ignition Architecture

A practical blueprint for deploying AI models alongside Ignition in cloud or hybrid environments to augment real-time control with intelligent decision-making.

In a cloud-enabled Ignition architecture, AI models typically operate in a tiered inference layer, separate from the core real-time control logic. The Ignition gateway, whether on-premises or at the edge, serves as the high-fidelity data fabric, streaming time-series data from PLCs, sensors, and SQL databases to a cloud-based inference service via secure MQTT or REST APIs. This separation allows the SCADA/MES layer to maintain its deterministic performance while offloading compute-intensive AI tasks—like multivariate anomaly detection, predictive quality scoring, or image-based inspection analysis—to scalable cloud resources. Key integration surfaces are Ignition's Tag Historian for feature extraction, its Transaction Groups for batched data payloads to the cloud, and its Scripting and Gateway Events to receive inference results and trigger automated actions.

Production rollout requires careful model versioning and feedback loops. A cloud-based model registry manages versions, allowing you to A/B test a new predictive maintenance model against the current one without disrupting the Ignition project. Inference results are written back to Ignition as new tags (e.g., Equipment_A/Health_Score), which can drive HMI alerts, Perspective dashboards, or automated logic in Ignition's Expression and Scripting modules. For governance, all data exchanges should be logged via Ignition's Audit Profile system, and model inferences that drive control actions (like setpoint adjustments) should pass through a human-in-the-loop approval step configured in Ignition's Alarm Notification pipeline or a custom workflow before execution.

This architecture matters because it lets manufacturers incrementally add intelligence without a forklift upgrade. You can start with a single use case—like using AI to classify alarm floods—deployed as a containerized service in Azure/AWS/GCP, consuming Ignition data. As value is proven, you can scale to more complex workflows, such as dynamic recipe optimization, where the cloud AI suggests parameter adjustments that are validated by Ignition's logic and then pushed to PLCs. The key is maintaining secure, low-latency data exchange between edge and cloud, which Ignition facilitates through its native IIoT connectors and tunneling capabilities, ensuring shop floor data never leaves a protected pipeline while enabling cloud-scale analytics.

ARCHITECTURE PATTERNS

Ignition Touchpoints for Cloud AI Integration

Real-Time Data Pipeline for AI Inference

Ignition's core strength is its ability to act as a unified data fabric, bridging PLCs, SQL databases, and IIoT sensors. For cloud AI, this becomes the primary ingestion layer.

Key Touchpoints:

  • Tag Historian: Stream high-frequency time-series data (vibration, temperature, pressure) to cloud object storage for model training and batch inference.
  • SQL Bridge: Use Ignition's SQL Bridge to query transactional databases (Oracle, SQL Server) for batch records, work orders, and quality results, syncing this structured data to a cloud data warehouse.
  • MQTT/Sparkplug B: Leverage Ignition's native MQTT Transmission modules to publish contextualized machine events and alarm states to a cloud-based message broker (e.g., AWS IoT Core, Azure Event Hubs) for real-time, low-latency AI inference.

This architecture creates a governed pipeline where raw plant data is contextualized, cleansed, and routed to the appropriate cloud AI service—be it for real-time anomaly detection or periodic batch analysis.

SCALABLE INFERENCE PATTERNS

High-Value Use Cases for Cloud AI with Ignition

Deploying AI models alongside Ignition in cloud or hybrid environments enables scalable, secure intelligence that augments real-time control and operational decision-making. These patterns focus on leveraging Ignition's data fabric to feed AI and return actionable inferences to the shop floor.

01

Predictive Maintenance at the Edge-Cloud Boundary

Deploy lightweight anomaly detection models on Ignition Edge for low-latency alerts, while running heavier prognostic models in the cloud for remaining useful life (RUL) prediction. Ignition's gateway syncs compressed features to the cloud for model retraining and aggregates fleet-wide failure patterns.

Batch -> Real-time
Alerting cadence
02

Cloud-Based Quality Scoring for Real-Time Lines

Stream high-frequency sensor data (vision, pressure, temperature) from Ignition's SCADA layer to a cloud inference endpoint. Return a composite quality score and predicted defect classification in sub-second latency to trigger automatic divert or rework commands within Ignition's control scripts.

Same shift
Model feedback loop
03

Centralized Model Versioning & A/B Testing

Use the cloud as a control plane to manage multiple model versions (e.g., for yield optimization). Ignition clients call a cloud-based model router that directs inference requests to canary or production endpoints, with results logged back for performance comparison and automated rollback.

1 sprint
Experiment cycle
04

Secure Data Exchange for Multi-Plant Intelligence

Aggregate anonymized or aggregated time-series data from multiple Ignition gateways across plants into a cloud data lake. Train global models for energy optimization or OEE benchmarking, then deploy distilled insights or lightweight models back to each site's local Ignition instance for execution.

Hours -> Minutes
Cross-site analysis
05

AI-Enhanced HMIs with Cloud-Powered Copilots

Build Ignition Perspective screens that embed a cloud-hosted conversational agent. Operators use natural language to ask complex questions (e.g., "Why did OEE drop last hour?"). The agent queries both local Ignition historian data and cloud-based business context to generate a narrative summary and suggested actions.

Batch -> Real-time
Insight delivery
06

Hybrid Forecasting for Production Scheduling

Combine local Ignition data (machine states, order progress) with cloud-sourced data (supplier lead times, weather) in a cloud inference pipeline. Generate a dynamic finite schedule and push optimized setpoints and sequences back to Ignition's MES modules for execution, adjusting in near real-time.

Same day
Reschedule response
CLOUD-DEPLOYMENT PATTERNS

Example AI-Ignition Cloud Workflows

These workflows illustrate how AI models deployed in cloud environments (AWS, Azure, GCP) integrate with Ignition's edge and gateway architecture to deliver scalable intelligence without disrupting real-time control.

Trigger: Ignition Edge agent streams time-series vibration, temperature, and amperage data from PLCs to a cloud-based data lake via MQTT or Ignition's built-in cloud connectors.

Context/Data Pulled: The cloud pipeline aggregates 30 days of historical data for the asset, along with recent maintenance records from a connected CMMS (e.g., SAP PM, Maximo).

Model or Agent Action: A trained anomaly detection model (e.g., Isolation Forest, LSTM autoencoder) runs inference on the new data batch. If an anomaly score exceeds threshold, a diagnostic agent is triggered to analyze feature contributions and cross-reference with a failure mode knowledge base.

System Update or Next Step: The AI service posts a structured alert payload back to a designated Ignition Gateway MQTT topic. Ignition's scripting environment consumes this payload, updates a real-time asset health dashboard in Perspective, and conditionally creates a low-priority work order in the CMMS via REST API.

Human Review Point: The work order is flagged for engineering review before scheduling. The alert includes the contributing sensor tags and a confidence score, allowing engineers to accept, reject, or request more data.

CLOUD-DEPLOYED INFERENCE FOR IGNITION

Implementation Architecture: Data Flow & System Components

A practical blueprint for deploying scalable AI models alongside Ignition in cloud or hybrid environments, focusing on secure, low-latency data exchange between edge and cloud.

The core architecture establishes Ignition as the real-time data fabric at the edge or in a private cloud, while AI inference runs in a scalable public cloud environment like AWS, Azure, or GCP. Ignition's Tag Historian, SQL Bridge, and Perspective modules serve as the primary interfaces. Time-series data from PLCs and sensors is aggregated into the Ignition historian. For inference, relevant tag data and batch context are packaged into JSON payloads and published via Ignition's Message Queuing or REST Client scripting to a secure cloud endpoint. This decouples the control-critical SCADA layer from the computationally intensive AI model serving, ensuring shop floor stability.

In the cloud, an API Gateway (e.g., AWS API Gateway, Azure API Management) receives the payloads, handles authentication, and routes requests to a containerized Model Serving layer (e.g., using KServe, Seldon Core, or a cloud-native service like Azure Machine Learning endpoints). This layer hosts versioned models for tasks like predictive maintenance scoring or anomaly detection. Inference results are returned to Ignition via a webhook or written to a cloud SQL database or message queue (e.g., Azure Service Bus, AWS SQS). Ignition then consumes these results, using them to update HMI displays in Perspective, trigger alarms, or execute control scripts. A Vector Database (e.g., Pinecone, Weaviate) can be deployed in the cloud to support RAG workflows, where Ignition's SQL Bridge queries historical work orders or maintenance logs to provide context to LLM-based operator copilots.

Governance and rollout require managing model versioning and A/B testing through the cloud serving layer, with all inference calls logged for audit and drift detection. Data in transit must be encrypted, and access controlled via service principals or API keys. A hybrid deployment might run lightweight, latency-critical models on Ignition Edge nodes, while complex models run in the cloud. This architecture allows manufacturers to scale AI inference elastically, maintain Ignition's real-time performance, and implement a secure feedback loop where model predictions can be validated against actual outcomes recorded back in Ignition's historian.

CLOUD DEPLOYMENT PATTERNS

Code & Configuration Examples

Deploying a Model as a Cloud Service

Deploy your trained model (e.g., for predictive maintenance or quality scoring) as a scalable REST API endpoint. This pattern keeps the heavy inference load off the Ignition gateway, using the cloud for elastic scaling and model version management. Ignition acts as the client, sending batches of real-time features for scoring.

python
# Example: Python FastAPI endpoint for model inference
from fastapi import FastAPI, HTTPException
from pydantic import BaseModel
import joblib
import numpy as np

app = FastAPI()
model = joblib.load("quality_predictor_v2.pkl")

class InferenceRequest(BaseModel):
    features: list[list[float]]  # Batch of sensor/process features
    batch_id: str

def validate_features(features):
    # Add data validation/cleaning logic
    return np.array(features)

@app.post("/predict")
async def predict(request: InferenceRequest):
    try:
        X = validate_features(request.features)
        predictions = model.predict(X).tolist()
        return {"batch_id": request.batch_id, "predictions": predictions}
    except Exception as e:
        raise HTTPException(status_code=400, detail=str(e))

Ignition uses a Python Scripting data source or an HTTP Client transformer to POST feature data to this endpoint and process the JSON response.

CLOUD-DEPLOYED AI INFERENCE

Realistic Operational Impact & Time Savings

This table illustrates the operational improvements and time savings achieved by deploying AI models in a cloud-native architecture alongside Ignition, enabling scalable inference, centralized model management, and secure data exchange between edge and cloud.

MetricBefore AIAfter AINotes

Model Deployment & Versioning

Manual, script-based pushes to edge gateways

Centralized container registry with CI/CD pipelines

Rollback and A/B testing for models across plants in minutes

Inference Scalability for Peak Loads

Fixed capacity on-premises servers

Auto-scaling cloud endpoints with GPU burst

Handles seasonal data spikes or plant-wide analytics without hardware procurement

Edge-to-Cloud Data Sync

Scheduled batch exports, manual reconciliation

Real-time, secure streaming with configurable filters

Cloud models train on near-live data; edge gets updated models without full data transfer

Security & Access Governance

Firewall rules and manual certificate management

Unified IAM, API keys, and audit trails for all model calls

Role-based access to different AI capabilities (e.g., quality vs. maintenance models)

Cross-Plant Model Consistency

Different model versions per site, manual drift checks

Single cloud-hosted model serving all sites, centralized monitoring

Ensures uniform predictions and simplifies compliance reporting

Developer & Data Scientist Workflow

Local Jupyter notebooks, no shared environment

Cloud-based notebooks with direct access to Ignition historian data

Faster iteration from prototype to production deployment on shared platform

Disaster Recovery for AI Services

Manual restore from backups, potential days of downtime

Multi-region cloud deployment with automated failover

AI inference remains available even if a primary data center fails

ARCHITECTING FOR PRODUCTION

Governance, Security & Phased Rollout

A secure, governed approach to deploying AI models alongside Ignition in cloud or hybrid environments.

A production AI integration with Ignition requires a clear separation of concerns between the real-time control layer and the AI inference layer. The typical architecture involves Ignition's Gateway and Perspective modules handling data acquisition, HMI logic, and operator interactions, while a separate cloud or on-premises inference service (hosted on Kubernetes, Azure ML, or AWS SageMaker) processes batched or streaming data via secure APIs. This separation ensures that critical control logic remains deterministic and within the OT environment, while AI models can be versioned, scaled, and updated independently in the IT/cloud domain. Data exchange is managed through Ignition's Tag Historian, Database Connections, or REST Client modules, sending context-rich payloads (e.g., equipment IDs, timestamps, sensor arrays) to the inference endpoint and receiving structured predictions (anomaly scores, setpoint recommendations, failure probabilities) back for visualization and action.

Security is enforced at multiple levels: network segmentation with firewalls between the plant floor and cloud, mutual TLS for API calls, and strict role-based access control (RBAC) within Ignition to govern which operators or engineers can view AI insights or trigger AI-driven actions. All inference requests and model outputs should be logged to an immutable audit trail, linking predictions to the specific production batch, machine, and operator session. For sensitive data, implement data anonymization or pseudonymization at the Ignition gateway before transmission, and ensure any cloud AI service complies with your data residency and sovereignty requirements. Use Ignition's scripting and alarm pipeline to implement human-in-the-loop approvals for any AI-recommended control changes, creating a mandatory review step before setpoints are written back to PLCs.

A phased rollout minimizes risk and builds operational trust. Start with a read-only Phase 1: deploy AI models for predictive analytics and alerting only, displaying insights in a dedicated Perspective view or alongside existing HMIs. Focus on non-critical equipment or a single production line. In Phase 2, introduce closed-loop recommendations, where the AI suggests actions (e.g., "adjust temperature setpoint by -2°C") that require operator acknowledgment before execution. Finally, Phase 3 enables fully automated, low-risk adjustments for specific, well-understood scenarios, with continuous monitoring of model performance and drift. Establish a model governance workflow using tools like MLflow or Weights & Biases, integrated with your CI/CD pipeline, to manage the promotion of new model versions from development to staging to production, with the ability to rollback to a previous version via Ignition's configuration management if inference quality degrades.

AI INTEGRATION WITH IGNITION

Frequently Asked Questions

Common technical and architectural questions for deploying AI models alongside Ignition in cloud or hybrid environments for scalable, secure manufacturing intelligence.

A secure data exchange is critical. The recommended pattern uses Ignition's MQTT Transmission module or a dedicated gateway service.

  1. Data Selection & Obfuscation: At the edge, filter and anonymize sensitive data (e.g., PII, proprietary formulas) before transmission. Use Ignition's scripting to tag data streams.
  2. Secure Transport: Establish an outbound, TLS-encrypted MQTT connection from the Ignition gateway to a cloud message broker (e.g., AWS IoT Core, Azure IoT Hub, HiveMQ Cloud). Use certificate-based authentication, never hard-coded keys.
  3. Cloud Ingestion & Inference: The broker routes payloads to a cloud queue (e.g., AWS SQS, Azure Service Bus). A serverless function (AWS Lambda, Azure Function) triggers, calls the AI model endpoint (hosted on SageMaker, Azure ML, or a container service), and returns the inference.
  4. Result Routing: Results are published back to a dedicated MQTT topic. The on-prem Ignition instance subscribes to this topic to receive actionable insights (e.g., a predictive alert, an optimized setpoint).

This pattern keeps the firewall simple (outbound-only), maintains data governance, and enables low-latency feedback loops.

Prasad Kumkar

About the author

Prasad Kumkar

CEO & MD, Inference Systems

Prasad Kumkar is the CEO & MD of Inference Systems and writes about AI systems architecture, LLM infrastructure, model serving, evaluation, and production deployment. Over 5+ years, he has worked across computer vision models, L5 autonomous vehicle systems, and LLM research, with a focus on taking complex AI ideas into real-world engineering systems.

His work and writing cover AI systems, large language models, AI agents, multimodal systems, autonomous systems, inference optimization, RAG, evaluation, and production AI engineering.