HIPO: Instruction Hierarchy via Constrained Reinforcement Learning | ScienceToStartup | ScienceToStartup