StaRPO: Stability-Augmented Reinforcement Policy Optimization | ScienceToStartup