Perception-Aware Multimodal Spatial Reasoning from Monocular Images | ScienceToStartup | ScienceToStartup