Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks | Signal Canvas | ScienceToStartup