What are the implications of vision-language models for the | ScienceToStartup | ScienceToStartup