3D Vision

TrendingProof pending

11papers

6.7viability

+100%30d

Proof pending

Proof pending. Core topic summary fields are still materializing.

State of the Field

3D vision is advancing rapidly, particularly in applications such as autonomous driving, robotics, and environmental modeling. Recent research focuses on improving point cloud registration, spatial reasoning, and feature matching, addressing challenges like noise and occlusions. Techniques like IGASA enhance registration accuracy through multi-scale feature extraction, while SpatialForge synthesizes spatial reasoning data from 2D images to bolster model performance. Innovations like LoMa and DecomPose refine local feature matching and object pose estimation, respectively, by leveraging large datasets and addressing optimization conflicts. These developments are crucial for builders aiming to implement robust 3D vision systems that can operate effectively in complex real-world environments, ensuring better performance and reliability in diverse applications.

Last updated Jun 6, 2026

Topic-linked question coverage is still building for this proof surface.

Topic trend

Topic-specific paper and score movement from the daily diff ledger.

Papers

1-10 of 11

Research Paper·Mar 13, 2026

IGASA: Integrated Geometry-Aware and Skip-Attention Modules for Enhanced Point Cloud Registration

Point cloud registration (PCR) is a fundamental task in 3D vision and provides essential support for applications such as autonomous driving, robotics, and environmental modeling. Despite its widespre...

8.0 viability

Research Paper·May 12, 2026

SpatialForge: Bootstrapping 3D-Aware Spatial Reasoning from Open-World 2D Images

Recent advancements in Large Vision-Language Models (VLMs) have demonstrated exceptional semantic understanding, yet these models consistently struggle with spatial reasoning, often failing at fundame...

7.0 viability

Research Paper·Apr 6, 2026

LoMa: Local Feature Matching Revisited

Local feature matching has long been a fundamental component of 3D vision systems such as Structure-from-Motion (SfM), yet progress has lagged behind the rapid advances of modern data-driven approache...

7.0 viabilityHas code

Research Paper·May 15, 2026·B2BConsumer

DecomPose: Disentangling Cross-Category Optimization Contention for Category-Level 6D Object Pose Estimation

Category-level 6D object pose estimation is typically formulated as a multi-category joint learning problem with fully shared model parameters. However, pronounced geometric heterogeneity across categ...

7.0 viability

Research Paper·May 27, 2026·B2BMedia & Entertainment

SSR3D-LLM: Structured Spatial Reasoning via Latent Steps for Fine-Grained Grounding in Unified 3D-LLMs

3D object grounding localizes referred objects in a 3D scene from natural language. Unified instance-centric 3D-LLMs aim to solve grounding together with dialog, QA, and captioning, yet many rely on a...

7.0 viability

Research Paper·Mar 26, 2026

AirSplat: Alignment and Rating for Robust Feed-Forward 3D Gaussian Splatting

While 3D Vision Foundation Models (3DVFMs) have demonstrated remarkable zero-shot capabilities in visual geometry estimation, their direct application to generalizable novel view synthesis (NVS) remai...

7.0 viability

Research Paper·Mar 25, 2026

SpectralSplats: Robust Differentiable Tracking via Spectral Moment Supervision

3D Gaussian Splatting (3DGS) enables real-time, photorealistic novel view synthesis, making it a highly attractive representation for model-based video tracking. However, leveraging the differentiabil...

7.0 viability

Research Paper·Mar 5, 2026

Dark3R: Learning Structure from Motion in the Dark

We introduce Dark3R, a framework for structure from motion in the dark that operates directly on raw images with signal-to-noise ratios (SNRs) below $-4$ dB -- a regime where conventional feature- and...

7.0 viability

Research Paper·May 12, 2026

PointGS: Semantic-Consistent Unsupervised 3D Point Cloud Segmentation with 3D Gaussian Splatting

Unsupervised point cloud segmentation is critical for embodied artificial intelligence and autonomous driving, as it mitigates the prohibitive cost of dense point-level annotations required by fully s...

7.0 viabilityHas code

Research Paper·Feb 26, 2026

SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs

3D Large Vision-Language Models (3D LVLMs) built upon Large Language Models (LLMs) have achieved remarkable progress across various multimodal tasks. However, their inherited position-dependent modeli...

5.0 viability

Page 1 of 2

3D Vision

Proof pending

State of the Field

Topic trend

Papers

IGASA: Integrated Geometry-Aware and Skip-Attention Modules for Enhanced Point Cloud Registration

SpatialForge: Bootstrapping 3D-Aware Spatial Reasoning from Open-World 2D Images

LoMa: Local Feature Matching Revisited

DecomPose: Disentangling Cross-Category Optimization Contention for Category-Level 6D Object Pose Estimation

SSR3D-LLM: Structured Spatial Reasoning via Latent Steps for Fine-Grained Grounding in Unified 3D-LLMs

AirSplat: Alignment and Rating for Robust Feed-Forward 3D Gaussian Splatting

SpectralSplats: Robust Differentiable Tracking via Spectral Moment Supervision

Dark3R: Learning Structure from Motion in the Dark

PointGS: Semantic-Consistent Unsupervised 3D Point Cloud Segmentation with 3D Gaussian Splatting

SoPE: Spherical Coordinate-Based Positional Embedding for Enhancing Spatial Perception of 3D LVLMs

Filters

Topic proof surfaces

3D Vision

Use this topic page as a durable research-area proof surface