VLM-SubtleBench: How Far Are VLMs from Human-Level Subtle Comparative Reasoning? | ScienceToStartup | ScienceToStartup