Vision-Language Models Unlock Task-Centric Latent Actions | ScienceToStartup | ScienceToStartup