How are multimodal foundation models benefiting from advancements in spatial reasoning?Answer not yet generated.