In a cloud-enabled Ignition architecture, AI models typically operate in a tiered inference layer, separate from the core real-time control logic. The Ignition gateway, whether on-premises or at the edge, serves as the high-fidelity data fabric, streaming time-series data from PLCs, sensors, and SQL databases to a cloud-based inference service via secure MQTT or REST APIs. This separation allows the SCADA/MES layer to maintain its deterministic performance while offloading compute-intensive AI tasks—like multivariate anomaly detection, predictive quality scoring, or image-based inspection analysis—to scalable cloud resources. Key integration surfaces are Ignition's Tag Historian for feature extraction, its Transaction Groups for batched data payloads to the cloud, and its Scripting and Gateway Events to receive inference results and trigger automated actions.
Integration
AI Integration with Ignition for Cloud Deployment

Where AI Fits in a Cloud-Enabled Ignition Architecture
A practical blueprint for deploying AI models alongside Ignition in cloud or hybrid environments to augment real-time control with intelligent decision-making.
Production rollout requires careful model versioning and feedback loops. A cloud-based model registry manages versions, allowing you to A/B test a new predictive maintenance model against the current one without disrupting the Ignition project. Inference results are written back to Ignition as new tags (e.g., Equipment_A/Health_Score), which can drive HMI alerts, Perspective dashboards, or automated logic in Ignition's Expression and Scripting modules. For governance, all data exchanges should be logged via Ignition's Audit Profile system, and model inferences that drive control actions (like setpoint adjustments) should pass through a human-in-the-loop approval step configured in Ignition's Alarm Notification pipeline or a custom workflow before execution.
This architecture matters because it lets manufacturers incrementally add intelligence without a forklift upgrade. You can start with a single use case—like using AI to classify alarm floods—deployed as a containerized service in Azure/AWS/GCP, consuming Ignition data. As value is proven, you can scale to more complex workflows, such as dynamic recipe optimization, where the cloud AI suggests parameter adjustments that are validated by Ignition's logic and then pushed to PLCs. The key is maintaining secure, low-latency data exchange between edge and cloud, which Ignition facilitates through its native IIoT connectors and tunneling capabilities, ensuring shop floor data never leaves a protected pipeline while enabling cloud-scale analytics.
Ignition Touchpoints for Cloud AI Integration
Real-Time Data Pipeline for AI Inference
Ignition's core strength is its ability to act as a unified data fabric, bridging PLCs, SQL databases, and IIoT sensors. For cloud AI, this becomes the primary ingestion layer.
Key Touchpoints:
- Tag Historian: Stream high-frequency time-series data (vibration, temperature, pressure) to cloud object storage for model training and batch inference.
- SQL Bridge: Use Ignition's SQL Bridge to query transactional databases (Oracle, SQL Server) for batch records, work orders, and quality results, syncing this structured data to a cloud data warehouse.
- MQTT/Sparkplug B: Leverage Ignition's native MQTT Transmission modules to publish contextualized machine events and alarm states to a cloud-based message broker (e.g., AWS IoT Core, Azure Event Hubs) for real-time, low-latency AI inference.
This architecture creates a governed pipeline where raw plant data is contextualized, cleansed, and routed to the appropriate cloud AI service—be it for real-time anomaly detection or periodic batch analysis.
High-Value Use Cases for Cloud AI with Ignition
Deploying AI models alongside Ignition in cloud or hybrid environments enables scalable, secure intelligence that augments real-time control and operational decision-making. These patterns focus on leveraging Ignition's data fabric to feed AI and return actionable inferences to the shop floor.
Predictive Maintenance at the Edge-Cloud Boundary
Deploy lightweight anomaly detection models on Ignition Edge for low-latency alerts, while running heavier prognostic models in the cloud for remaining useful life (RUL) prediction. Ignition's gateway syncs compressed features to the cloud for model retraining and aggregates fleet-wide failure patterns.
Cloud-Based Quality Scoring for Real-Time Lines
Stream high-frequency sensor data (vision, pressure, temperature) from Ignition's SCADA layer to a cloud inference endpoint. Return a composite quality score and predicted defect classification in sub-second latency to trigger automatic divert or rework commands within Ignition's control scripts.
Centralized Model Versioning & A/B Testing
Use the cloud as a control plane to manage multiple model versions (e.g., for yield optimization). Ignition clients call a cloud-based model router that directs inference requests to canary or production endpoints, with results logged back for performance comparison and automated rollback.
Secure Data Exchange for Multi-Plant Intelligence
Aggregate anonymized or aggregated time-series data from multiple Ignition gateways across plants into a cloud data lake. Train global models for energy optimization or OEE benchmarking, then deploy distilled insights or lightweight models back to each site's local Ignition instance for execution.
AI-Enhanced HMIs with Cloud-Powered Copilots
Build Ignition Perspective screens that embed a cloud-hosted conversational agent. Operators use natural language to ask complex questions (e.g., "Why did OEE drop last hour?"). The agent queries both local Ignition historian data and cloud-based business context to generate a narrative summary and suggested actions.
Hybrid Forecasting for Production Scheduling
Combine local Ignition data (machine states, order progress) with cloud-sourced data (supplier lead times, weather) in a cloud inference pipeline. Generate a dynamic finite schedule and push optimized setpoints and sequences back to Ignition's MES modules for execution, adjusting in near real-time.
Example AI-Ignition Cloud Workflows
These workflows illustrate how AI models deployed in cloud environments (AWS, Azure, GCP) integrate with Ignition's edge and gateway architecture to deliver scalable intelligence without disrupting real-time control.
Trigger: Ignition Edge agent streams time-series vibration, temperature, and amperage data from PLCs to a cloud-based data lake via MQTT or Ignition's built-in cloud connectors.
Context/Data Pulled: The cloud pipeline aggregates 30 days of historical data for the asset, along with recent maintenance records from a connected CMMS (e.g., SAP PM, Maximo).
Model or Agent Action: A trained anomaly detection model (e.g., Isolation Forest, LSTM autoencoder) runs inference on the new data batch. If an anomaly score exceeds threshold, a diagnostic agent is triggered to analyze feature contributions and cross-reference with a failure mode knowledge base.
System Update or Next Step: The AI service posts a structured alert payload back to a designated Ignition Gateway MQTT topic. Ignition's scripting environment consumes this payload, updates a real-time asset health dashboard in Perspective, and conditionally creates a low-priority work order in the CMMS via REST API.
Human Review Point: The work order is flagged for engineering review before scheduling. The alert includes the contributing sensor tags and a confidence score, allowing engineers to accept, reject, or request more data.
Implementation Architecture: Data Flow & System Components
A practical blueprint for deploying scalable AI models alongside Ignition in cloud or hybrid environments, focusing on secure, low-latency data exchange between edge and cloud.
The core architecture establishes Ignition as the real-time data fabric at the edge or in a private cloud, while AI inference runs in a scalable public cloud environment like AWS, Azure, or GCP. Ignition's Tag Historian, SQL Bridge, and Perspective modules serve as the primary interfaces. Time-series data from PLCs and sensors is aggregated into the Ignition historian. For inference, relevant tag data and batch context are packaged into JSON payloads and published via Ignition's Message Queuing or REST Client scripting to a secure cloud endpoint. This decouples the control-critical SCADA layer from the computationally intensive AI model serving, ensuring shop floor stability.
In the cloud, an API Gateway (e.g., AWS API Gateway, Azure API Management) receives the payloads, handles authentication, and routes requests to a containerized Model Serving layer (e.g., using KServe, Seldon Core, or a cloud-native service like Azure Machine Learning endpoints). This layer hosts versioned models for tasks like predictive maintenance scoring or anomaly detection. Inference results are returned to Ignition via a webhook or written to a cloud SQL database or message queue (e.g., Azure Service Bus, AWS SQS). Ignition then consumes these results, using them to update HMI displays in Perspective, trigger alarms, or execute control scripts. A Vector Database (e.g., Pinecone, Weaviate) can be deployed in the cloud to support RAG workflows, where Ignition's SQL Bridge queries historical work orders or maintenance logs to provide context to LLM-based operator copilots.
Governance and rollout require managing model versioning and A/B testing through the cloud serving layer, with all inference calls logged for audit and drift detection. Data in transit must be encrypted, and access controlled via service principals or API keys. A hybrid deployment might run lightweight, latency-critical models on Ignition Edge nodes, while complex models run in the cloud. This architecture allows manufacturers to scale AI inference elastically, maintain Ignition's real-time performance, and implement a secure feedback loop where model predictions can be validated against actual outcomes recorded back in Ignition's historian.
Code & Configuration Examples
Deploying a Model as a Cloud Service
Deploy your trained model (e.g., for predictive maintenance or quality scoring) as a scalable REST API endpoint. This pattern keeps the heavy inference load off the Ignition gateway, using the cloud for elastic scaling and model version management. Ignition acts as the client, sending batches of real-time features for scoring.
python# Example: Python FastAPI endpoint for model inference from fastapi import FastAPI, HTTPException from pydantic import BaseModel import joblib import numpy as np app = FastAPI() model = joblib.load("quality_predictor_v2.pkl") class InferenceRequest(BaseModel): features: list[list[float]] # Batch of sensor/process features batch_id: str def validate_features(features): # Add data validation/cleaning logic return np.array(features) @app.post("/predict") async def predict(request: InferenceRequest): try: X = validate_features(request.features) predictions = model.predict(X).tolist() return {"batch_id": request.batch_id, "predictions": predictions} except Exception as e: raise HTTPException(status_code=400, detail=str(e))
Ignition uses a Python Scripting data source or an HTTP Client transformer to POST feature data to this endpoint and process the JSON response.
Realistic Operational Impact & Time Savings
This table illustrates the operational improvements and time savings achieved by deploying AI models in a cloud-native architecture alongside Ignition, enabling scalable inference, centralized model management, and secure data exchange between edge and cloud.
| Metric | Before AI | After AI | Notes |
|---|---|---|---|
Model Deployment & Versioning | Manual, script-based pushes to edge gateways | Centralized container registry with CI/CD pipelines | Rollback and A/B testing for models across plants in minutes |
Inference Scalability for Peak Loads | Fixed capacity on-premises servers | Auto-scaling cloud endpoints with GPU burst | Handles seasonal data spikes or plant-wide analytics without hardware procurement |
Edge-to-Cloud Data Sync | Scheduled batch exports, manual reconciliation | Real-time, secure streaming with configurable filters | Cloud models train on near-live data; edge gets updated models without full data transfer |
Security & Access Governance | Firewall rules and manual certificate management | Unified IAM, API keys, and audit trails for all model calls | Role-based access to different AI capabilities (e.g., quality vs. maintenance models) |
Cross-Plant Model Consistency | Different model versions per site, manual drift checks | Single cloud-hosted model serving all sites, centralized monitoring | Ensures uniform predictions and simplifies compliance reporting |
Developer & Data Scientist Workflow | Local Jupyter notebooks, no shared environment | Cloud-based notebooks with direct access to Ignition historian data | Faster iteration from prototype to production deployment on shared platform |
Disaster Recovery for AI Services | Manual restore from backups, potential days of downtime | Multi-region cloud deployment with automated failover | AI inference remains available even if a primary data center fails |
Governance, Security & Phased Rollout
A secure, governed approach to deploying AI models alongside Ignition in cloud or hybrid environments.
A production AI integration with Ignition requires a clear separation of concerns between the real-time control layer and the AI inference layer. The typical architecture involves Ignition's Gateway and Perspective modules handling data acquisition, HMI logic, and operator interactions, while a separate cloud or on-premises inference service (hosted on Kubernetes, Azure ML, or AWS SageMaker) processes batched or streaming data via secure APIs. This separation ensures that critical control logic remains deterministic and within the OT environment, while AI models can be versioned, scaled, and updated independently in the IT/cloud domain. Data exchange is managed through Ignition's Tag Historian, Database Connections, or REST Client modules, sending context-rich payloads (e.g., equipment IDs, timestamps, sensor arrays) to the inference endpoint and receiving structured predictions (anomaly scores, setpoint recommendations, failure probabilities) back for visualization and action.
Security is enforced at multiple levels: network segmentation with firewalls between the plant floor and cloud, mutual TLS for API calls, and strict role-based access control (RBAC) within Ignition to govern which operators or engineers can view AI insights or trigger AI-driven actions. All inference requests and model outputs should be logged to an immutable audit trail, linking predictions to the specific production batch, machine, and operator session. For sensitive data, implement data anonymization or pseudonymization at the Ignition gateway before transmission, and ensure any cloud AI service complies with your data residency and sovereignty requirements. Use Ignition's scripting and alarm pipeline to implement human-in-the-loop approvals for any AI-recommended control changes, creating a mandatory review step before setpoints are written back to PLCs.
A phased rollout minimizes risk and builds operational trust. Start with a read-only Phase 1: deploy AI models for predictive analytics and alerting only, displaying insights in a dedicated Perspective view or alongside existing HMIs. Focus on non-critical equipment or a single production line. In Phase 2, introduce closed-loop recommendations, where the AI suggests actions (e.g., "adjust temperature setpoint by -2°C") that require operator acknowledgment before execution. Finally, Phase 3 enables fully automated, low-risk adjustments for specific, well-understood scenarios, with continuous monitoring of model performance and drift. Establish a model governance workflow using tools like MLflow or Weights & Biases, integrated with your CI/CD pipeline, to manage the promotion of new model versions from development to staging to production, with the ability to rollback to a previous version via Ignition's configuration management if inference quality degrades.
Enabling Efficiency, Speed & Accuracy
Intelligent Analysis, Decision & Execution
We build AI systems for teams that need search across company data, workflow automation across tools, or AI features inside products and internal software.
Talk to Us
Search across company data
Give teams answers from docs, tickets, runbooks, and product data with sources and permissions.
Useful when people spend too long searching or get different answers from different systems.

Automate internal workflows
Use AI to route work, draft outputs, trigger actions, and keep approvals and logs in place.
Useful when repetitive work moves across multiple tools and teams.

Add AI to products and internal tools
Build assistants, guided actions, or decision support into the software your team or customers already use.
Useful when AI needs to be part of the product, not a separate tool.
Frequently Asked Questions
Common technical and architectural questions for deploying AI models alongside Ignition in cloud or hybrid environments for scalable, secure manufacturing intelligence.
A secure data exchange is critical. The recommended pattern uses Ignition's MQTT Transmission module or a dedicated gateway service.
- Data Selection & Obfuscation: At the edge, filter and anonymize sensitive data (e.g., PII, proprietary formulas) before transmission. Use Ignition's scripting to tag data streams.
- Secure Transport: Establish an outbound, TLS-encrypted MQTT connection from the Ignition gateway to a cloud message broker (e.g., AWS IoT Core, Azure IoT Hub, HiveMQ Cloud). Use certificate-based authentication, never hard-coded keys.
- Cloud Ingestion & Inference: The broker routes payloads to a cloud queue (e.g., AWS SQS, Azure Service Bus). A serverless function (AWS Lambda, Azure Function) triggers, calls the AI model endpoint (hosted on SageMaker, Azure ML, or a container service), and returns the inference.
- Result Routing: Results are published back to a dedicated MQTT topic. The on-prem Ignition instance subscribes to this topic to receive actionable insights (e.g., a predictive alert, an optimized setpoint).
This pattern keeps the firewall simple (outbound-only), maintains data governance, and enables low-latency feedback loops.

About the author
Prasad Kumkar
CEO & MD, Inference Systems
Prasad Kumkar is the CEO & MD of Inference Systems and writes about AI systems architecture, LLM infrastructure, model serving, evaluation, and production deployment. Over 5+ years, he has worked across computer vision models, L5 autonomous vehicle systems, and LLM research, with a focus on taking complex AI ideas into real-world engineering systems.
His work and writing cover AI systems, large language models, AI agents, multimodal systems, autonomous systems, inference optimization, RAG, evaluation, and production AI engineering.
Partnered with leading AI, data, and software stack.
How We Work
Custom AI workflows for your Business
One-fit-all AI don't work for modern businesses. At Inferensys, we aim to understand your business & custom requirements; which we use to define most efficient agentic workflows, the data, and the tools for your business.
01
Review the use case
We understand the task, the users, and where AI can actually help.
Read more02
Pick the right approach
We define what needs search, automation, or product integration.
Read more03
Build the first useful version
We implement the part that proves the value first.
Read more04
Improve from there
We add the checks and visibility needed to keep it useful.
Read moreThe first call is a practical review of your use case and the right next step.
Talk to Us