Measuring AI Agents' Progress on Multi-Step Cyber Attack Scenarios | ScienceToStartup | ScienceToStartup