Alignment-Weighted DPO: A principled reasoning approach to improve safety alignment | ScienceToStartup | ScienceToStartup