Mitigating Mismatch within Reference-based Preference Optimization | ScienceToStartup | ScienceToStartup