What are the research directions for enabling vision-language agents to perform multi-step reasoning?Reviewed by ScienceToStartup EditorialUpdated 6/2/2026Answer not yet generated.