Rationale Matters: Learning Transferable Rubrics via Proxy-Guided Critique for VLMReward Models | ScienceToStartup | ScienceToStartup