MARS: Margin-Aware Reward-Modeling with Self-Refinement | ScienceToStartup | ScienceToStartup