How do knowledge distillation frameworks decouple teacher and student architectures for improved LLM training efficiency?Answer not yet generated.