RAPO: Risk-Aware Preference Optimization for Generalizable Safe Reasoning | ScienceToStartup | ScienceToStartup