SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions | ScienceToStartup