How are recent developments in spatial reasoning impacting multimodal foundation models?Answer not yet generated.