Dual-Space Knowledge Distillation with Key-Query Matching for Large Language Models with Vocabulary Mismatch | ScienceToStartup | ScienceToStartup