A Diffusion Analysis of Policy Gradient for Stochastic Bandits | ScienceToStartup | ScienceToStartup