Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models | ScienceToStartup | ScienceToStartup