Best-of-Q: Improving VLM agents with Q-function Action Ranking at Inference | ScienceToStartup | ScienceToStartup