Adapting Text LLMs to Speech via Multimodal Depth Up-Scaling | ScienceToStartup | ScienceToStartup