How can vision language models enhance overall performance a | ScienceToStartup | ScienceToStartup