Reward Models Inherit Value Biases from Pretraining | ScienceToStartup | ScienceToStartup