AI workloads fit into the OpenShift Serverless architecture at three key surfaces: the Knative Serving layer for model endpoints, the Knative Eventing system for workflow triggers, and the OpenShift Serverless Functions framework for developer productivity. The primary integration points are:
- Knative Services as AI Endpoints: Deploy fine-tuned LLMs, embedding models, or multi-modal models as serverless services. These automatically scale from zero based on request concurrency, optimizing for sporadic inference traffic and eliminating idle GPU costs.
- Event Sources and Brokers for AI Triggers: Configure Knative Eventing sources (like Kafka, CloudEvents, or CronJobs) to invoke AI services. For example, a new document uploaded to an S3-compatible bucket can trigger a summarization service, or a scheduled event can initiate a batch data enrichment workflow.
- Functions for Lightweight AI Agents: Use the
funcCLI and OpenShift Serverless Functions to build and deploy lightweight Python or Node.js functions that wrap AI SDK calls (e.g., OpenAI, Anthropic, Hugging Face) for tasks like sentiment analysis, classification, or data formatting, abstracting the underlying infrastructure.




