Skip to main content
SDFP: Speculative Decoding with FIT-Pruned Models for Training-Free and Plug-and-Play LLM Acceleration | Buildability Receipt | ScienceToStartup