Rapid Action Value Estimation (RAVE) is a heuristic technique that modifies the backpropagation phase of Monte Carlo Tree Search (MCTS). Instead of updating statistics only for nodes on the specific path taken during a simulation, RAVE also updates a separate, global statistic for each unique action played anywhere in that simulation. This creates an all-moves-as-first (AMAF) heuristic, providing a faster, more sample-efficient estimate of an action's general value, which is particularly beneficial in the early stages of search when node visit counts are low.
