Reference-guided Policy Optimization for Molecular Optimization via LLM Reasoning | ScienceToStartup | ScienceToStartup