Writer-R1: Enhancing Generative Writing in LLMs via Memory-augmented Replay Policy Optimization | ScienceToStartup | ScienceToStartup