Hierarchical Pre-Training of Vision Encoders with Large Language Models | ScienceToStartup | ScienceToStartup