Offline Exploration-Aware Fine-Tuning for Long-Chain Mathematical Reasoning | ScienceToStartup | ScienceToStartup