ARXIV:2603.26658 · COMPUTER VISION · SUBMITTED 30 MAR · 20:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: partial proof status

Zero-Shot Depth from Defocus

Yiming Zuo · Hongyu Wen · Venkat Subramanian · Patrick Chen · Karhan Kayan · Mario Bijelic · +2 at arXiv

A novel Transformer-based architecture for zero-shot depth estimation from focus stacks, achieving significant performance improvements on a new real-world benchmark.

Ship in 2-4 weeks›Score7.0Evidence partial

Opportunity summary

Pain A novel Transformer-based architecture for zero-shot depth estimation from focus stacks, achieving significant performance improvements on a new real-world benchmark.

Evidence 62 refs | 4 sources | 83% coverage

Blocker Evidence partial

Open Build Read PDF Signal Canvas Track

PROBLEM

A novel Transformer-based architecture for zero-shot depth estimation from focus stacks, achieving significant performance improvements on a new real-world benchmark. Unlike previous works overfitting to a certain dataset, this paper focuses on the challenging…

METHOD

Full abstract

Depth from Defocus (DfD) is the task of estimating a dense metric depth map from a focus stack. Unlike previous works overfitting to a certain dataset, this paper focuses on the challenging and practical setting of zero-shot generalization. We first propose a new real-world DfD benchmark ZEDD, which contains 8.3x more scenes and significantly higher quality images and ground-truth depth maps compared to previous benchmarks. We also design a novel network architecture named FOSSA. FOSSA is a Transformer-based architecture with novel designs tailored to the DfD task. The key contribution is a stack attention layer with a focus distance embedding, allowing efficient information exchange across the focus stack. Finally, we develop a new training data pipeline allowing us to utilize existing large-scale RGBD datasets to generate synthetic focus stacks. Experiment results on ZEDD and other benchmarks show a significant improvement over the baselines, reducing errors by up to 55.7%. The ZEDD benchmark is released at https://zedd.cs.princeton.edu. The code and checkpoints are released at https://github.com/princeton-vl/FOSSA.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Experiment results on ZEDD and other benchmarks show a significant improvement over the baselines, reducing errors by up to 55.7%. A public repository is…

WHY NOW

Computer Vision moved forward this cycle; last verified April 2026. Public score 7.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA novel Transformer-based architecture for zero-shot depth estimation from focus stacks, achieving significant performance improvements on a new real-world benchmark.

Evidence62 refs | 4 sources | 83% coverage

Blockerno shell-level blocker reported

Analysis summary

A novel Transformer-based architecture for zero-shot depth estimation from focus stacks, achieving significant performance improvements on a new real-world benchmark.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: partial proof status

Competitive landscape

A novel Transformer-based architecture for zero-shot depth estimation from focus stacks, achieving significant performance improvements on a new real-world benchmark.

Segment

Computer Vision

Adoption evidence

Public code linked for build inspection

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "a1155b16-f036-4416-8829-86990cb1c73c", "arxiv_id": "2603.26658", "canonical_route": "/paper/zero-shot-depth-from-defocus", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "zero-shot-depth-from-defocus", "endpoints": { "paper_pack": "/api/v1/paper/zero-shot-depth-from-defocus/paper-pack", "build_passport": "/api/v1/paper/zero-shot-depth-from-defocus/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Zero-Shot Depth from Defocus", "normalized_query": "2603.26658", "route": "/paper/zero-shot-depth-from-defocus", "paper_ref": "zero-shot-depth-from-defocus", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/zero-shot-depth-from-defocus#webpage", "url": "https://sciencetostartup.com/paper/zero-shot-depth-from-defocus", "name": "Zero-Shot Depth from Defocus", "description": "A novel Transformer-based architecture for zero-shot depth estimation from focus stacks, achieving significant performance improvements on a new real-world benchmark.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/zero-shot-depth-from-defocus#scholarlyArticle", "headline": "Zero-Shot Depth from Defocus", "description": "A novel Transformer-based architecture for zero-shot depth estimation from focus stacks, achieving significant performance improvements on a new real-world benchmark.", "url": "https://sciencetostartup.com/paper/zero-shot-depth-from-defocus", "sameAs": "https://arxiv.org/abs/2603.26658", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.26658" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-27T17:56:26.000Z", "author": [ { "@type": "Person", "name": "Yiming Zuo" }, { "@type": "Person", "name": "Hongyu Wen" }, { "@type": "Person", "name": "Venkat Subramanian" }, { "@type": "Person", "name": "Patrick Chen" }, { "@type": "Person", "name": "Karhan Kayan" }, { "@type": "Person", "name": "Mario Bijelic" }, { "@type": "Person", "name": "Felix Heide" }, { "@type": "Person", "name": "Jia Deng" } ], "codeRepository": "https://github.com/princeton-vl/FOSSA", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Computer Vision" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code, repo url" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/zero-shot-depth-from-defocus#software", "name": "Zero-Shot Depth from Defocus - Source Code", "description": "A novel Transformer-based architecture for zero-shot depth estimation from focus stacks, achieving significant performance improvements on a new real-world benchmark.", "codeRepository": "https://github.com/princeton-vl/FOSSA", "url": "https://github.com/princeton-vl/FOSSA" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Computer Vision", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Zero-Shot Depth from Defocus", "item": "https://sciencetostartup.com/paper/zero-shot-depth-from-defocus" } ] } ] }

Competitive landscape

A novel Transformer-based architecture for zero-shot depth estimation from focus stacks, achieving significant performance improvements on a new real-world benchmark.

Segment

Computer Vision

Adoption evidence

Public code linked for build inspection

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Zero-Shot Depth from Defocus

Zero-Shot Depth from Defocus

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline