A transition model, also known as a dynamics model, is a learned function that predicts the next state of an environment given the current state and an action taken by an agent. It serves as an internal simulation of the world's dynamics, enabling the agent to plan and evaluate sequences of actions without costly real-world trial and error. This model is central to achieving sample efficiency, a primary advantage of model-based over model-free reinforcement learning.
