Amplification Effects in Test-Time Reinforcement Learning: Safety and Reasoning Vulnerabilities | ScienceToStartup | ScienceToStartup