ARM: Advantage Reward Modeling for Long-Horizon Manipulation | ScienceToStartup