Glossary

Infrastructure as Code (IaC)

Infrastructure as Code (IaC) is the practice of managing and provisioning computing infrastructure through machine-readable definition files, rather than physical hardware configuration or interactive configuration tools.

Get in touch Learn more

Developer demonstrating multi-agent tool use, agent tool selection interface on laptop, casual tech demo moment.

TRAFFIC AND DEPLOYMENT STRATEGIES

What is Infrastructure as Code (IaC)?

Infrastructure as Code (IaC) is a foundational DevOps practice for managing and provisioning computing infrastructure through machine-readable definition files, rather than manual processes.

Infrastructure as Code (IaC) is the practice of managing and provisioning computing infrastructure—including networks, virtual machines, load balancers, and connection topology—through machine-readable definition files, rather than physical hardware configuration or interactive configuration tools. This treats servers, storage, and networking as versionable, testable, and repeatable software artifacts. Core tools like Terraform, AWS CloudFormation, and Pulumi enable teams to define their desired infrastructure state declaratively, which an automation engine then provisions and enforces.

For LLM deployment, IaC is critical for creating reproducible, scalable environments for model serving, vector databases, and monitoring stacks. It enables GitOps workflows where infrastructure changes are tracked via pull requests, and automated pipelines apply them. This ensures that canary deployments, auto-scaling policies for inference endpoints, and multi-region failover configurations are consistent, auditable, and free from configuration drift, directly supporting progressive delivery and high availability strategies.

FOUNDATIONAL CONCEPTS

Core Principles of Infrastructure as Code

Declarative vs. Imperative

IaC tools operate on two primary paradigms. Declarative (or functional) IaC defines the desired end state of the infrastructure (e.g., 'ensure 5 web servers exist'), and the tool's engine determines the sequence of operations to achieve it. Examples include Terraform, AWS CloudFormation, and Pulumi (in declarative mode). Imperative IaC specifies the exact sequence of commands to execute to reach the state (e.g., 'run this API call, then that one'). Tools like Ansible (playbooks) and shell scripts often use this approach. Declarative IaC is generally preferred for its idempotency and focus on outcome over process.

Idempotency

A fundamental property where applying the same IaC configuration multiple times results in the same infrastructure state, regardless of the starting point. This ensures safety and predictability. If a configuration declares 3 instances, running it once creates 3 instances; running it a second time does nothing, as the desired state is already met. This prevents configuration drift and allows for safe, repeated execution as part of Continuous Integration/Continuous Deployment (CI/CD) pipelines. Non-idempotent scripts can create duplicate resources or cause errors on re-runs.

Version Control & Collaboration

IaC definition files are treated as source code, stored and versioned in systems like Git. This enables:

Full Audit Trail: Every change to infrastructure is tracked with a commit history, showing who changed what and why.
Code Review: Infrastructure changes undergo peer review via pull requests, improving quality and knowledge sharing.
Branching & Merging: Teams can work on infrastructure changes in isolation (e.g., feature branches) and merge them systematically.
Rollback Capability: Reverting to a previous, known-good infrastructure state is as simple as reverting a Git commit and re-applying the configuration.

Immutable Infrastructure

The practice of replacing entire infrastructure components (e.g., servers, containers) with new, versioned instances rather than modifying existing ones in-place. Instead of patching or updating a live server, a new server image (AMI, Docker container) is built from the IaC definitions, deployed, and the old one is terminated. This eliminates configuration drift, ensures consistency between environments (dev, staging, prod), and simplifies rollback (deploy the previous image). It is a core pattern enabled by IaC and is central to modern cloud and container-based deployments.

Automation & CI/CD Integration

IaC enables the full automation of infrastructure provisioning and management. Code changes trigger automated pipelines that:

Validate syntax and configuration.
Plan/Preview changes in a sandbox (e.g., terraform plan).
Apply changes to environments automatically or with approval gates. This integration is the foundation of GitOps, where the Git repository state is the single source of truth, and automated operators continuously reconcile the live infrastructure to match. It reduces manual error, accelerates deployment frequency, and enforces consistent governance.

Modularity & Reusability

IaC promotes the creation of reusable, parameterized modules or templates that abstract complex infrastructure patterns. For example, a 'web cluster' module could encapsulate an auto-scaling group, load balancer, and security groups. This module can then be reused across multiple projects or environments (dev, prod) with different input variables (instance size, min/max nodes). This DRY (Don't Repeat Yourself) principle reduces code duplication, standardizes architecture, and makes large-scale infrastructure manageable. Public and private registries (like the Terraform Registry) facilitate sharing these modules across teams and organizations.

TRAFFIC AND DEPLOYMENT STRATEGIES

How Infrastructure as Code Works

Infrastructure as Code (IaC) is the foundational engineering practice for managing modern, scalable LLM deployments. It automates the provisioning of the compute, networking, and storage resources required for model serving, traffic routing, and high-availability rollouts.

Infrastructure as Code (IaC) is the practice of managing and provisioning computing infrastructure through machine-readable definition files, rather than manual hardware configuration. For LLM operations, this means defining model-serving clusters, load balancers, auto-scaling policies, and network security as declarative code (e.g., in Terraform or Pulumi). This code is version-controlled, enabling reproducible, auditable, and consistent environments from development to production, which is critical for canary deployments and multi-region deployment strategies.

The core mechanism is a declarative or imperative model where a desired infrastructure state is defined. An IaC tool (like Terraform, AWS CloudFormation, or Crossplane) then orchestrates cloud provider APIs to create, update, or destroy resources to match that state. This automates the entire lifecycle, enabling GitOps workflows where infrastructure changes are peer-reviewed and automatically applied. For LLM serving, this ensures that traffic splitting, service mesh configurations, and horizontal pod autoscaler rules are deployed identically every time, eliminating configuration drift and enabling rapid, safe rollbacks.

COMPARISON

IaC vs. Traditional Infrastructure Management

A side-by-side comparison of Infrastructure as Code (IaC) and traditional, manual infrastructure management across key operational dimensions.

Feature / Dimension	Infrastructure as Code (IaC)	Traditional Infrastructure Management
Core Methodology	Declarative or imperative definition files (e.g., Terraform, CloudFormation, Pulumi)	Manual configuration via CLI, GUI, or ad-hoc scripts
Provisioning Speed	Minutes to hours for full environment creation	Days to weeks for procurement, setup, and configuration
Change Management	Version-controlled code reviews, automated drift detection, and reconciliation	Manual change tickets, runbooks, and inconsistent documentation
Consistency & Idempotency	True. Environments are reproducible and identical across deployments.	False. Configuration drift and 'snowflake servers' are common.
Disaster Recovery	Infrastructure can be recreated from source code in < 1 hour	Recovery relies on backups and manual rebuilds, often taking days
Collaboration & Audit Trail	Git-based workflows provide full history, authorship, and peer review	Relies on ticket systems and individual knowledge; audit trails are fragmented
Cost Visibility & Optimization	Resource tagging and cost estimation are integral; unused resources are easily identified and terminated	Cost tracking is retrospective and manual; orphaned resources frequently lead to waste
Integration with CI/CD	True. Infrastructure changes are tested and deployed as part of the application pipeline.	False. Infrastructure is a separate, manual process decoupled from application delivery.

INFRASTRUCTURE AS CODE

Common IaC Tools and Platforms

Infrastructure as Code (IaC) is managed through declarative or imperative definition files. The ecosystem is dominated by a few major tools, each with distinct philosophies and target environments.

Terraform

Terraform by HashiCorp is the dominant declarative IaC tool, using its own Hashicorp Configuration Language (HCL). It manages resources by defining a desired end-state, and its core innovation is the state file, which tracks the real-world resources it manages. Terraform's strength is its vast provider ecosystem, enabling management of resources across AWS, Azure, GCP, and thousands of other services via plugins.

Key Concept: Desired State Configuration.
Primary Use: Multi-cloud provisioning and lifecycle management.
State Management: Requires secure storage (e.g., Terraform Cloud, S3 backend) for team collaboration.

EXPLORE

AWS CloudFormation

AWS CloudFormation is Amazon's native, declarative IaC service for managing AWS resources. Infrastructure is defined using JSON or YAML templates. It is tightly integrated with the AWS ecosystem, offering deep awareness of AWS services and their dependencies. Stacks are the fundamental unit of management, and changes are executed as change sets for preview.

Key Concept: Stack-based management with automatic dependency resolution.
Primary Use: Exclusive, deep management of AWS environments.
Rollback: Built-in automatic rollback on failure to a previous known state.

EXPLORE

Pulumi

Pulumi is a modern IaC platform that allows developers to define infrastructure using general-purpose programming languages like Python, TypeScript, Go, and C#. This approach, known as Infrastructure as Software, enables the use of loops, conditionals, and code reuse. It supports true imperative logic while still performing declarative resource planning. It maintains its own state and supports major clouds and Kubernetes.

Key Concept: Infrastructure defined using familiar programming languages.
Primary Use: Developer-centric IaC with strong abstraction and testing capabilities.
Difference: Contrasts with domain-specific languages (DSL) like HCL.

EXPLORE

Ansible

Ansible by Red Hat is primarily a configuration management and application deployment tool that can be used for IaC. It is agentless, using SSH or WinRM, and follows an imperative model, executing tasks defined in YAML playbooks. While it can provision cloud resources via modules, its strength is in post-provision configuration, making it a common companion to tools like Terraform.

Key Concept: Idempotent, agentless configuration management.
Primary Use: OS configuration, software installation, and service management.
Model: Procedural/imperative; describes a list of tasks to execute.

EXPLORE

Crossplane

Crossplane is a Kubernetes-native IaC framework that extends the Kubernetes API to manage both cloud services and Kubernetes clusters themselves. You define infrastructure using Kubernetes Custom Resources (CRDs) and YAML manifests. It treats cloud resources (like databases or buckets) as Managed Resources that can be composed into higher-level abstractions, applying Kubernetes patterns like controllers and operators to external infrastructure.

Key Concept: Kubernetes-style declarative management for anything.
Primary Use: Unified control plane for container and cloud infrastructure.
Architecture: Built on the Kubernetes controller pattern for continuous reconciliation.

EXPLORE

CDK for Terraform & Cloud Development Kit

The Cloud Development Kit (CDK) paradigm provides programming language abstractions for defining cloud infrastructure. AWS CDK generates CloudFormation templates. CDK for Terraform (CDKTF) allows the use of programming languages to generate Terraform HCL configuration and state. This bridges the gap between the flexibility of Pulumi and the ecosystem of Terraform or CloudFormation.

Key Concept: Synthesizes infrastructure code from programming languages into underlying IaC configs.
Primary Use: Developer productivity on top of established Terraform or CloudFormation ecosystems.
Output: CDKTF produces Terraform JSON, which can be used by the standard terraform CLI.

EXPLORE

INFRASTRUCTURE AS CODE (IAC)

Frequently Asked Questions

Infrastructure as Code (IaC) is a foundational DevOps practice for managing and provisioning computing infrastructure through machine-readable definition files. This FAQ addresses its core principles, tools, and role in modern LLM deployment and traffic management.

Infrastructure as Code (IaC) is the practice of managing and provisioning computing infrastructure—including servers, networks, databases, and container clusters—through machine-readable definition files, rather than manual hardware configuration or interactive tools. It works by using declarative or imperative code (written in languages like HCL, YAML, or Python) to describe the desired state of the infrastructure. This code is then executed by an IaC tool (like Terraform, AWS CloudFormation, or Pulumi), which calls cloud provider APIs to create, update, or destroy resources to match the defined state. For LLM deployments, this means defining GPU clusters, autoscaling groups for inference endpoints, and vector database instances as code, ensuring identical, repeatable environments for development, staging, and production.

Enabling Efficiency, Speed & Accuracy

Intelligent Analysis, Decision & Execution

We build AI systems for teams that need search across company data, workflow automation across tools, or AI features inside products and internal software.

Talk to Us

Search across company data

Give teams answers from docs, tickets, runbooks, and product data with sources and permissions.

Useful when people spend too long searching or get different answers from different systems.

Enterprise searchRAGPermissions

Automate internal workflows

Use AI to route work, draft outputs, trigger actions, and keep approvals and logs in place.

Useful when repetitive work moves across multiple tools and teams.

AI agentsWorkflow automationGovernance

Add AI to products and internal tools

Build assistants, guided actions, or decision support into the software your team or customers already use.

Useful when AI needs to be part of the product, not a separate tool.

AI integrationDecision supportModel routing

INFRASTRUCTURE AS CODE (IAC) ECOSYSTEM

Related Terms

Infrastructure as Code (IaC) is a foundational practice within modern DevOps and Platform Engineering. It enables the management of computing infrastructure through machine-readable definition files, ensuring consistency, repeatability, and version control. The following concepts are critical to implementing and scaling IaC effectively.

Declarative vs. Imperative IaC

IaC tools operate on two primary paradigms. Declarative IaC (e.g., Terraform, AWS CloudFormation) defines the desired end state of the infrastructure, and the tool's engine determines the sequence of API calls to achieve it. Imperative IaC (e.g., Ansible, Chef, Puppet) defines the specific commands or steps needed to configure the infrastructure. Declarative approaches are generally preferred for provisioning cloud resources due to their idempotency and focus on outcome, while imperative tools excel at configuration management on existing servers.

GitOps

GitOps is an operational framework that extends IaC principles. It uses Git repositories as the single source of truth for both application code and declarative infrastructure definitions. Automated operators (like Flux or ArgoCD) continuously monitor the Git repo and reconcile the state of the live system (e.g., a Kubernetes cluster) with the committed definitions. This creates a closed-loop system where all changes are versioned, auditable, and applied automatically, enforcing rigorous change management and rollback capabilities.

Configuration Drift

Configuration drift occurs when the actual, running state of infrastructure diverges from the state defined in the IaC source files. This is a major risk that IaC aims to prevent. Drift can be introduced by:

Manual, ad-hoc changes made directly in the cloud console.
External processes or scripts not managed by IaC.
Resource failures and manual recovery procedures. IaC tools like Terraform can detect drift by comparing its state file to the live cloud environment. Mitigation involves either re-applying the IaC to enforce the defined state or importing the drifted resource back under IaC management.