Reinforcement Learning with Verifiable Rewards (RLVR) | Glossary | ScienceToStartup