Deriving Hyperparameter Scaling Laws via Modern Optimization Theory | ScienceToStartup | ScienceToStartup