How can reinforcement learning be applied to optimize multi- | ScienceToStartup | ScienceToStartup