How can reinforcement learning be applied to optimize multi-robot coordination with user guidance?Answer not yet generated.