Discovering Failure Modes in Vision-Language Models using RL | ScienceToStartup