EvolveTool-Bench: Evaluating the Quality of LLM-Generated Tool Libraries as Software Artifacts | ScienceToStartup | ScienceToStartup