Understanding the Fine-Grained Knowledge Capabilities of Vision-Language Models | ScienceToStartup | ScienceToStartup