Cost-Efficient Multimodal LLM Inference via Cross-Tier GPU Heterogeneity | ScienceToStartup | ScienceToStartup