Imitation learning is a machine learning paradigm where an agent learns a policy by directly mimicking expert demonstrations, bypassing the need to design a complex reward function. The core assumption is that the provided demonstrations represent near-optimal behavior. This approach is highly sample-efficient for complex tasks where specifying a reward is difficult, such as autonomous driving or robotic manipulation. It is closely related to supervised learning, where the state-action pairs from the expert become the training dataset.
