Group Distributionally Robust Optimization-Driven Reinforcement Learning for LLM Reasoning | ScienceToStartup | ScienceToStartup