Optional Stopping Theory provides conditions under which the expected value of a stochastic process at a random stopping time equals its initial expected value. It is used to design principled rules for terminating a process, such as pruning suboptimal search paths in LLM decoding.
Optional Stopping Theory is a mathematical concept that helps determine when to stop a process that unfolds over time, especially when dealing with uncertainty. It provides a rigorous way to make stopping decisions, ensuring that the expected outcome remains consistent. In AI, it's used to efficiently prune less promising search paths in complex tasks like language model reasoning.
Optional Stopping Theorem, Doob's Optional Stopping Theorem
Was this definition helpful?