Multi-turn context is the accumulated sequence of user inputs, model responses, and system instructions across an entire conversational session, which must be managed within the model's fixed token limit to maintain coherence and state. This sequential history forms the model's working memory, allowing it to reference prior exchanges, follow instructions, and exhibit consistent personality. Effective management is critical for agentic workflows, where an autonomous system must track goals, actions, and outcomes over many steps without losing critical information.
