Context window saturation occurs when a model's token limit is completely filled, blocking the ingestion of new input. This forces a trade-off: to add fresh data, existing context must be evicted, truncated, or summarized, which can degrade performance by removing relevant information or increasing computational overhead. In agentic workflows, saturation disrupts stateful reasoning by breaking the continuity of multi-turn dialogue or long-form task execution.
