A context eviction policy is a deterministic rule set that governs which pieces of cached information are removed first when a system's allocated memory or token budget is exhausted. In systems like a transformer's KV Cache or an agent's working memory buffer, these policies—such as Least Recently Used (LRU) or First-In-First-Out (FIFO)—automatically select data for eviction to make room for new inputs, directly impacting reasoning continuity and computational efficiency.
