Trajectory optimization is a planning method that searches for a sequence of actions (a trajectory) that minimizes a defined cost function or maximizes cumulative reward over a finite time horizon, subject to a model of the system's dynamics. It treats planning as a numerical optimization problem, finding the most efficient path from an initial state to a goal state according to the model's predictions. This is a fundamental technique in model-based reinforcement learning (MBRL) and optimal control for tasks like robotics and autonomous systems.
