TSR: Trajectory-Search Rollouts for Multi-Turn RL of LLM Agents | ScienceToStartup | ScienceToStartup