Ranking Reasoning LLMs under Test-Time Scaling | ScienceToStartup | ScienceToStartup