A planning horizon is the finite number of future time steps an agent considers when simulating action sequences using its internal world model. It defines the trade-off between computational cost and decision quality, as a longer horizon allows for more strategic, long-term planning but requires exponentially more simulation. In algorithms like Model Predictive Control (MPC) or Monte Carlo Tree Search (MCTS), the horizon is a critical hyperparameter that directly controls the agent's foresight.
