Skip to main content
GAP-MLLM: Geometry-Aligned Pre-training for Activating 3D Spatial Perception in Multimodal Large Language Models | Signal Canvas | ScienceToStartup