Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards | ScienceToStartup | ScienceToStartup