Visual Preference Optimization with Rubric Rewards | ScienceToStartup