Reaching Beyond the Mode: RL for Distributional Reasoning in Language Models | Signal Canvas | ScienceToStartup