Pre-training LLM without Learning Rate Decay Enhances Supervised Fine-Tuning | ScienceToStartup | ScienceToStartup