Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning | ScienceToStartup | ScienceToStartup