TSR: Trajectory-Search Rollouts for Multi-Turn RL of LLM Agents | Signal Canvas | ScienceToStartup