Alignment-Weighted DPO: A principled reasoning approach to improve safety alignment | Signal Canvas | ScienceToStartup