Agent Quality of Service (QoS) is a classification—Guaranteed, Burstable, or BestEffort—assigned by an orchestrator like Kubernetes based on an agent's declared resource requests and limits. This classification directly influences the agent's scheduling priority and its order of eviction when the system experiences resource pressure, providing a critical mechanism for ensuring that high-priority agents receive the compute resources they need to function reliably.
