How can knowledge distillation be applied to pre-trained LLM | ScienceToStartup | ScienceToStartup