SLA2: Sparse-Linear Attention with Learnable Routing and QAT | ScienceToStartup | ScienceToStartup