Population Based Training (PBT) is an asynchronous optimization algorithm that merges hyperparameter optimization with model training. Instead of training models independently, PBT maintains a population of models that are trained in parallel. Periodically, it evaluates the population, allows the best-performing models (the exploit step) to pass their parameters and hyperparameters to underperforming ones, and then randomly perturbs the hyperparameters of the copied models (the explore step). This creates a continuous, online search where training and hyperparameter tuning are unified into a single process.
