How does scenario diversity in AI benchmarking contribute to | ScienceToStartup