BBPE16: UTF-16-based byte-level byte-pair encoding for improved multilingual speech recognition | ScienceToStartup | ScienceToStartup