The Token Games: Evaluating Language Model Reasoning with Puzzle Duels | ScienceToStartup | ScienceToStartup