How do datasets isolating spatial reasoning from visual input reveal model strengths and weaknesses?Answer not yet generated.