Pessimistic Auxiliary Policy for Offline Reinforcement Learning | ScienceToStartup | ScienceToStartup