Reproducible, Explainable, and Effective Evaluations of Agentic AI for Software Engineering | ScienceToStartup | ScienceToStartup