subword tokenization | Glossary | ScienceToStartup