A probabilistic ensemble is a set of multiple neural networks, each trained independently on the same dataset to predict an environment's dynamics, where the statistical disagreement among the ensemble members is used to estimate predictive uncertainty. This quantified uncertainty is critical for robust planning and directed exploration, as it allows an agent to distinguish between reliable and unreliable model predictions. In model-based reinforcement learning (MBRL), this technique directly addresses the challenge of model error and compounding error in imagined rollouts.
