How does Contrast-Driven Rubric Reward Model improve data ef | ScienceToStartup | ScienceToStartup