Cost-Efficient Multimodal LLM Inference via Cross-Tier GPU Heterogeneity | Signal Canvas | ScienceToStartup