Dirichlet noise is a random perturbation sampled from a Dirichlet distribution and added to the prior probabilities at the root node of a Monte Carlo Tree Search. This injection of stochasticity, as used in systems like AlphaZero, encourages the search algorithm to explore a broader range of initial actions during the early phases of self-play games, preventing premature convergence on a narrow set of moves. It directly addresses the exploration-exploitation tradeoff by ensuring the root's children are visited even if their initial neural network priors are low.
