Watson & Holmes: A Naturalistic Benchmark for Comparing Human and LLM Reasoning | ScienceToStartup | ScienceToStartup