Compounding error is the phenomenon where small inaccuracies in a learned transition model are amplified over the course of a multi-step imagined rollout. Each step's prediction error becomes the input for the next, causing the simulated state to diverge increasingly from the trajectory that would occur in the real environment. This leads the agent's planning process to optimize for unrealistic futures, ultimately degrading the performance of the deployed policy.
