Agent scheduling is the algorithmic process by which an orchestration system assigns a specific agent instance to run on a particular compute node or host machine. This decision is based on a set of constraints, resource requirements (CPU, memory, GPU), and declarative affinity or anti-affinity rules. The scheduler's primary goal is to optimize for system-wide objectives like load balancing, minimizing latency, reducing cost, and ensuring high availability, all while adhering to the defined policies.
