Adaptive Decoding via Test-Time Policy Learning for Self-Improving Generation | ScienceToStartup | ScienceToStartup