Visually-Guided Policy Optimization for Multimodal Reasoning | ScienceToStartup