Skip to main content
Large-Step Training Dynamics of a Two-Factor Linear Transformer Model | Buildability Receipt | ScienceToStartup