Glossary

Optimal Control Problem (OCP)

An Optimal Control Problem (OCP) is the mathematical formulation at the core of MPC, defined by a cost function to be minimized, a dynamic model describing the system's evolution, and a set of constraints on states and inputs, all over a specified time horizon.

Get in touch Learn more

ML engineer managing model versions on laptop, version history visible, technical Git-like workflow.

CORE FORMULATION

What is an Optimal Control Problem (OCP)?

The Optimal Control Problem (OCP) is the foundational mathematical framework for determining the best possible sequence of actions to steer a dynamic system towards a goal.

An Optimal Control Problem (OCP) is a mathematical optimization framework that determines a sequence of control inputs to minimize a cost function while respecting a dynamic model and constraints on states and inputs over a defined time horizon. It is the core computational problem solved at each step in Model Predictive Control (MPC). The solution is a trajectory of optimal actions that transitions the system from an initial state to a desired objective.

The formal OCP definition comprises four key elements: a dynamic model (e.g., differential equations) predicting state evolution; a cost function quantifying performance goals like tracking error or energy use; path constraints on admissible states and control inputs; and boundary conditions for initial and final states. Solving this constrained optimization yields an optimal control policy, providing the theoretical basis for advanced, predictive automation in robotics, process control, and autonomous systems.

MATHEMATICAL FORMULATION

Core Components of an OCP

An Optimal Control Problem (OCP) is the mathematical core of Model Predictive Control (MPC), defined by a dynamic model, a cost function, and constraints, all optimized over a finite time horizon.

Dynamic Model (System Equations)

The dynamic model is a set of mathematical equations, typically ordinary differential equations (ODEs) or difference equations, that describe how the system's state variables evolve over time in response to control inputs and disturbances. This internal model is the predictive engine of the OCP.

Continuous-time: dx/dt = f(x(t), u(t), p)
Discrete-time: x_{k+1} = f(x_k, u_k, p)

Where x is the state, u is the control input, and p are parameters. The fidelity of this model directly determines the accuracy of the controller's predictions.

Cost Function (Objective)

The cost function (or objective function) J is a scalar quantity that the OCP seeks to minimize. It mathematically encodes the control performance goals over the prediction horizon N.

A common quadratic form for tracking is: J = Σ (x_k - x_ref)^T Q (x_k - x_ref) + (u_k - u_ref)^T R (u_k - u_ref)

The first term penalizes deviation from a desired state reference trajectory.
The second term penalizes excessive control effort.
The matrices Q and R are tuning weights that balance these competing objectives.

State & Input Constraints

Constraints are mathematical inequalities that define the admissible operating region of the physical system. Explicitly handling these is a primary advantage of MPC.

State Constraints: x_min ≤ x(t) ≤ x_max (e.g., position limits, tank levels).
Input Constraints: u_min ≤ u(t) ≤ u_max (e.g., actuator saturation, valve limits).
Mixed Constraints: May involve both states and inputs.

Constraints can be hard (must never be violated) or soft (allowed violation with a penalty via slack variables to ensure the optimization problem remains feasible).

Optimization Horizon

The OCP is solved over a finite optimization horizon, which consists of two key intervals:

Prediction Horizon (N_p): The future time window over which the system's behavior is predicted using the dynamic model.
Control Horizon (N_c): The (often shorter) window over which control moves are optimized. Control inputs are typically held constant for steps beyond N_c.

The principle of Receding Horizon Control is applied: only the first control input from the optimized sequence is applied to the plant. The horizon then "shifts" forward one step, and the OCP is re-solved with new state measurements, providing feedback and disturbance rejection.

Initial Condition

The initial condition x_0 is the estimated or measured state of the system at the start of the current optimization window. It serves as the starting point for all predictions made by the dynamic model.

In practice, the full state is often not directly measurable. Therefore, a state estimator (like a Kalman Filter or Moving Horizon Estimator (MHE)) is used to reconstruct x_0 from available sensor outputs. The accuracy of this initial condition is critical for prediction quality.

Numerical Solver

The OCP is a constrained optimization problem that must be solved numerically at each control step. The choice of solver depends on the problem structure:

Linear/Quadratic MPC: Results in a Quadratic Program (QP). Solved efficiently with active-set or interior-point methods.
Nonlinear MPC (NMPC): Results in a Nonlinear Program (NLP). Solved using Sequential Quadratic Programming (SQP) or interior-point methods.

Real-Time Optimization (RTO) demands that the solver converges to a solution within the controller's sampling period (often milliseconds). Techniques like warm-starting (using the previous solution as an initial guess) are essential for meeting this timing requirement.

MATHEMATICAL FOUNDATION

How Optimal Control Problems Work

An Optimal Control Problem (OCP) is the core mathematical framework for determining the best sequence of actions to steer a dynamic system toward a goal while respecting physical and operational limits.

An Optimal Control Problem (OCP) is a mathematical formulation that defines how to control a dynamic system over time to minimize a cost function while satisfying a dynamic model and a set of state and input constraints. It provides the formal structure for Model Predictive Control (MPC), where this optimization is solved repeatedly in real-time. The problem is defined over a finite prediction horizon, balancing objectives like tracking accuracy, energy use, and safety.

Solving an OCP involves computing a sequence of future control inputs. For linear dynamics with a quadratic cost, this becomes a tractable Quadratic Programming (QP) problem. For nonlinear systems, it transforms into a more complex Nonlinear Programming (NLP). Key design elements include terminal costs and constraints to ensure stability, and techniques like warm-starting to accelerate online computation. The solution dictates the immediate actuator command before the horizon recedes and the process repeats.

FROM THEORY TO ACTUATION

Real-World OCP Applications

The Optimal Control Problem (OCP) is not an abstract exercise; it is the computational engine powering autonomous systems that interact with the physical world. These applications solve the core OCP—minimizing a cost subject to dynamics and constraints—in real-time to achieve precise, safe, and efficient control.

Autonomous Vehicle Trajectory Planning

Self-driving cars solve a high-speed OCP at every timestep. The cost function penalizes deviation from a lane center, excessive acceleration (for passenger comfort), and jerk. The dynamic model is the vehicle's kinematics (position, velocity, heading). Hard constraints ensure the vehicle stays within road boundaries and avoids collision polygons of other actors, while input constraints represent steering angle and acceleration limits. The solved trajectory is a smooth, collision-free path executed by lower-level controllers.

EXPLORE

Industrial Robot Arm Manipulation

For a robotic arm placing a component on a circuit board, the OCP computes the optimal joint torques. Key elements include:

Cost: Minimize time-to-completion and energy consumption.
Dynamics: Multi-body rigid dynamics equations (from Lagrangian or Newton-Euler methods).
State Constraints: Joint angle limits to prevent self-collision.
Input Constraints: Maximum torque output of each servo motor.
Path Constraints: The end-effector must follow a precise spatial path to insert the component correctly. This results in a smooth, time-optimal motion profile.

EXPLORE

Quadrotor Drone Stabilization & Acrobatics

Drones are inherently unstable and require millisecond-rate OCP solving. The model is the nonlinear dynamics of six degrees of freedom (position & orientation) under forces from four rotors. The cost heavily penalizes roll and pitch errors to maintain level flight. For aggressive maneuvers like flips, the cost switches to track a dynamic orientation trajectory. Constraints are critical: rotor speeds have strict minimum and maximum bounds (thrust limits), and the attitude must avoid singularities. This application highlights the need for fast, reliable Nonlinear MPC (NMPC) solvers.

EXPLORE

Chemical Process Control (Distillation Column)

In a refinery, a distillation column separates hydrocarbons. The OCP here is often Economic MPC, where the cost function directly maximizes profit. The dynamic model consists of complex mass and energy balance differential equations. State constraints keep temperature and pressure within safe operating windows to prevent runaway reactions. Input constraints limit valve opening rates. The controller continuously adjusts heat input and reflux flow to optimize for product purity and throughput while respecting all safety and operational limits, demonstrating OCP's role in high-stakes, continuous processes.

EXPLORE

Energy Management in Hybrid Electric Vehicles

The powertrain OCP decides how to split power demand between the internal combustion engine and the electric battery. The cost function minimizes total fuel consumption over a trip horizon. The state includes battery state-of-charge (SOC). A critical state constraint is that the final SOC must not drop below a threshold (e.g., 20%). Input constraints define the power limits of each source. The controller predicts future power demand based on route data and solves for the optimal blend, often pre-computing solutions via Explicit MPC for deployment on vehicle ECUs.

EXPLORE

Rocket Landing Guidance (Powered Descent)

This is a quintessential, high-stakes OCP. The goal is to land a rocket on a pad with near-zero velocity. The cost minimizes fuel consumption (a.k.a., maximum final mass). The dynamics are the rocket's translational motion under gravity and variable thrust. Control constraints are severe: thrust magnitude and direction have strict bounds. Path constraints ensure the vehicle avoids the ground until the final moment. The solution is a fuel-optimal thrust profile, often computed in real-time using convexification techniques to handle the non-convex thrust-bound constraints, showcasing OCP's role in aerospace guidance.

EXPLORE

CLASSIFICATION

Types of Optimal Control Problems

A comparison of the primary mathematical formulations for Optimal Control Problems (OCPs), distinguished by their model dynamics, cost function structure, and the nature of constraints.

Problem Feature	Linear-Quadratic Regulator (LQR)	Nonlinear Optimal Control (NOCP)	Mixed-Integer Optimal Control (MIOCP)	Stochastic Optimal Control (SOCP)
Core Dynamic Model	Linear Time-Invariant (LTI)	Nonlinear Differential Equations	Hybrid (Continuous & Discrete)	Stochastic Differential Equations
Typical Cost Function	Quadratic (State & Input)	General Nonlinear	General (with integer variables)	Expectation of a cost
Primary Constraint Type	Unconstrained / Linear	Nonlinear Equality/Inequality	Logic & Integer Constraints	Chance Constraints
Typical Solver Class	Algebraic Riccati Equation (ARE)	Nonlinear Programming (NLP)	Mixed-Integer Programming (MIP)	Dynamic Programming / Convex Approx.
Online Computation	Explicit (Pre-computed)	Numerical (Online Optimization)	Numerical (Online, NP-Hard)	Numerical (Scenario-Based)
Primary Application	Setpoint Regulation	Robotics, Chemical Processes	Systems with Discrete Modes	Finance, Noisy Systems
Stability Guarantees	Global (Nominal)	Local (Requires Design)	Problem-Dependent	Probabilistic
Handles State/Input Limits

OPTIMAL CONTROL PROBLEM (OCP)

Frequently Asked Questions

An Optimal Control Problem (OCP) is the core mathematical formulation for Model Predictive Control (MPC). It defines the objective, dynamics, and limits a controller must respect to achieve optimal performance. These FAQs address its components, solution methods, and role in robotics and automation.

An Optimal Control Problem (OCP) is a mathematical framework for finding a sequence of control inputs that drives a dynamic system from an initial state to a desired state while minimizing a cost function and satisfying constraints on states and inputs over a specified time horizon. It is the fundamental optimization solved at each time step in Model Predictive Control (MPC). The OCP is formally defined by three core elements: a dynamic model (e.g., differential or difference equations) that predicts future states, a cost function (or objective function) to be minimized (e.g., tracking error + control effort), and a set of constraints that define operational limits (e.g., actuator saturation, safety boundaries).

Enabling Efficiency, Speed & Accuracy

Intelligent Analysis, Decision & Execution

We build AI systems for teams that need search across company data, workflow automation across tools, or AI features inside products and internal software.

Talk to Us

Search across company data

Give teams answers from docs, tickets, runbooks, and product data with sources and permissions.

Useful when people spend too long searching or get different answers from different systems.

Enterprise searchRAGPermissions

Automate internal workflows

Use AI to route work, draft outputs, trigger actions, and keep approvals and logs in place.

Useful when repetitive work moves across multiple tools and teams.

AI agentsWorkflow automationGovernance

Add AI to products and internal tools

Build assistants, guided actions, or decision support into the software your team or customers already use.

Useful when AI needs to be part of the product, not a separate tool.

AI integrationDecision supportModel routing

OPTIMAL CONTROL PROBLEM (OCP)

Related Terms

The Optimal Control Problem (OCP) is the mathematical core of Model Predictive Control. These related concepts define its components, solution methods, and applications.

Cost Function (Objective Function)

The cost function is the scalar mathematical expression that an Optimal Control Problem (OCP) seeks to minimize. It quantifies the performance objective over the prediction horizon.

Typical Form: Often a quadratic function penalizing tracking error (deviation from a reference) and control effort.
Economic MPC: In advanced applications, the cost can directly represent economic objectives like energy consumption or profit.
Terminal Cost: An additional term evaluated at the final predicted state, used as a stability-enforcing design tool.

State & Input Constraints

Constraints are mathematical inequalities that define the admissible operating region of the system, explicitly enforced within the OCP optimization.

State Constraints: Limits on system variables like position, velocity, or temperature (e.g., x_min ≤ x(t) ≤ x_max).
Input Constraints: Physical limits on actuator commands, such as torque saturation or valve position limits.
Hard vs. Soft: Hard constraints must never be violated. Soft constraints use slack variables to allow minor, penalized violations, ensuring the optimization problem remains feasible.

Dynamic Model

The dynamic model is the set of differential or difference equations that describes how the system's state evolves over time in response to control inputs. It is the internal predictor within the OCP.

Forms: Can be linear time-invariant (LTI), nonlinear, or hybrid.
Fidelity: Model accuracy directly impacts MPC performance. System identification techniques are used to derive models from data.
Example: For a drone, the model encodes the rigid-body dynamics relating rotor thrusts to position, velocity, and orientation.

Receding Horizon Control

Receding Horizon Control is the real-time execution policy of MPC. At each control interval, the OCP is solved over a finite future window, but only the first control input is applied to the physical system.

Principle: After application, the horizon "recedes" forward by one time step, new state measurements are taken, and the OCP is re-solved with updated initial conditions.
Feedback: This closed-loop approach provides inherent compensation for disturbances and model mismatch.
Core Advantage: It allows the controller to continuously adjust its plan based on the latest information.

Quadratic Programming (QP) Solver

A Quadratic Programming (QP) solver is the numerical optimization engine at the heart of Linear MPC. It efficiently solves the convex optimization problem generated at each time step.

Problem Structure: Minimizes a quadratic cost function subject to linear equality and inequality constraints.
Real-Time Requirement: Must compute a solution within the system's sampling period (e.g., milliseconds).
Common Algorithms: Active-set methods and interior-point methods are standard. For embedded systems, highly optimized, code-generated solvers like qpOASES or OSQP are used.

Nonlinear MPC (NMPC)

Nonlinear MPC is the generalization of MPC where the OCP involves a nonlinear dynamic model and/or a non-quadratic cost function, resulting in a Nonlinear Programming (NLP) problem.

Application: Essential for systems with significant nonlinear dynamics (e.g., chemical reactors, agile robotics).
Computational Challenge: The NLP is non-convex and more computationally intensive. Sequential Quadratic Programming (SQP) and Interior-Point methods are common solution approaches.
Tools: Frameworks like ACADO Toolkit and CasADi are used to model and solve NMPC problems efficiently.

About the author

Prasad Kumkar

CEO & MD, Inference Systems

Prasad Kumkar is the CEO & MD of Inference Systems and writes about AI systems architecture, LLM infrastructure, model serving, evaluation, and production deployment. Over 5+ years, he has worked across computer vision models, L5 autonomous vehicle systems, and LLM research, with a focus on taking complex AI ideas into real-world engineering systems.

His work and writing cover AI systems, large language models, AI agents, multimodal systems, autonomous systems, inference optimization, RAG, evaluation, and production AI engineering.

Limited slotsGet a Free AI Consultation

How We Work

Custom AI workflows for your Business

One-fit-all AI don't work for modern businesses. At Inferensys, we aim to understand your business & custom requirements; which we use to define most efficient agentic workflows, the data, and the tools for your business.

Talk to Us

Optimal Control Problem (OCP)

What is an Optimal Control Problem (OCP)?

Core Components of an OCP

Dynamic Model (System Equations)

Cost Function (Objective)

State & Input Constraints

Optimization Horizon

Initial Condition

Numerical Solver

How Optimal Control Problems Work

Real-World OCP Applications

Autonomous Vehicle Trajectory Planning

Industrial Robot Arm Manipulation

Quadrotor Drone Stabilization & Acrobatics

Chemical Process Control (Distillation Column)

Energy Management in Hybrid Electric Vehicles

Rocket Landing Guidance (Powered Descent)

Types of Optimal Control Problems

Frequently Asked Questions

Intelligent Analysis, Decision & Execution

Search across company data

Automate internal workflows

Add AI to products and internal tools

Prasad Kumkar

Partnered with leading AI, data, and software stack.

Custom AI workflows for your Business

Review the use case

Pick the right approach

Build the first useful version

Improve from there