State of Vision Language Models | Report | ScienceToStartup