What are the limitations of current vision language models i | ScienceToStartup | ScienceToStartup