ARXIV:2605.10865 · PROGRAMMATIC CAD · SUBMITTED 12 MAY · 20:14 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD

Haozhe Zhang · Kaichen Liu · Miaomiao Chen · Lei Li · Shaojie Yang · Cheng Peng · +1 at arXiv

BenchCAD is a comprehensive benchmark for industrial CAD code generation, revealing current model limitations in producing faithful parametric programs and enabling fine-grained analysis.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain BenchCAD is a comprehensive benchmark for industrial CAD code generation, revealing current model limitations in producing faithful parametric programs and enabling fine-grained analysis.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

BenchCAD is a comprehensive benchmark for industrial CAD code generation, revealing current model limitations in producing faithful parametric programs and enabling fine-grained analysis. Beyond recognizing the outer shape of a part, this task involves…

METHOD

Full abstract

Industrial Computer-Aided Design (CAD) code generation requires models to produce executable parametric programs from visual or textual inputs. Beyond recognizing the outer shape of a part, this task involves understanding its 3D structure, inferring engineering parameters, and choosing CAD operations that reflect how the part would be designed and manufactured. Despite the promise of Multimodal large language models (MLLMs) for this task, they are rarely evaluated on whether these capabilities jointly hold in realistic industrial CAD settings. We present BenchCAD, a unified benchmark for industrial CAD reasoning. BenchCAD contains 17,900 execution-verified CadQuery programs across 106 industrial part families, including bevel gears, compression springs, twist drills, and other reusable engineering designs. It evaluates models through visual question answering, code question answering, image-to-code generation, and instruction-guided code editing, enabling fine-grained analysis across perception, parametric abstraction, and executable program synthesis. Across 10+ frontier models, BenchCAD shows that current systems often recover coarse outer geometry but fail to produce faithful parametric CAD programs. Common failures include missing fine 3D structure, misinterpreting industrial design parameters, and replacing essential operations such as sweeps, lofts, and twist-extrudes with simpler sketch-and-extrude patterns. Fine-tuning and reinforcement learning improve in-distribution performance, but generalization to unseen part families remains limited. These results position BenchCAD as a benchmark for measuring and improving the industrial readiness of multimodal CAD automation.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Across 10+ frontier models, BenchCAD shows that current systems often recover coarse outer geometry but fail to produce faithful parametric CAD programs. Code availability…

WHY NOW

Programmatic CAD moved forward this cycle; last verified May 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainBenchCAD is a comprehensive benchmark for industrial CAD code generation, revealing current model limitations in producing faithful parametric programs and enabling fine-grained analysis.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

BenchCAD is a comprehensive benchmark for industrial CAD code generation, revealing current model limitations in producing faithful parametric programs and enabling fine-grained analysis.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

BenchCAD is a comprehensive benchmark for industrial CAD code generation, revealing current model limitations in producing faithful parametric programs and enabling fine-grained analysis.

Segment

Programmatic CAD

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "af7d31dc-e21d-4fe8-93b0-4cba23aa0da8", "arxiv_id": "2605.10865", "canonical_route": "/paper/benchcad-a-comprehensive-industry-standard-benchmark-for-programmatic-cad", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "benchcad-a-comprehensive-industry-standard-benchmark-for-programmatic-cad", "endpoints": { "paper_pack": "/api/v1/paper/benchcad-a-comprehensive-industry-standard-benchmark-for-programmatic-cad/paper-pack", "build_passport": "/api/v1/paper/benchcad-a-comprehensive-industry-standard-benchmark-for-programmatic-cad/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD", "normalized_query": "2605.10865", "route": "/paper/benchcad-a-comprehensive-industry-standard-benchmark-for-programmatic-cad", "paper_ref": "benchcad-a-comprehensive-industry-standard-benchmark-for-programmatic-cad", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/benchcad-a-comprehensive-industry-standard-benchmark-for-programmatic-cad#webpage", "url": "https://sciencetostartup.com/paper/benchcad-a-comprehensive-industry-standard-benchmark-for-programmatic-cad", "name": "BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD", "description": "BenchCAD is a comprehensive benchmark for industrial CAD code generation, revealing current model limitations in producing faithful parametric programs and enabling fine-grained analysis.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/benchcad-a-comprehensive-industry-standard-benchmark-for-programmatic-cad#scholarlyArticle", "headline": "BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD", "description": "BenchCAD is a comprehensive benchmark for industrial CAD code generation, revealing current model limitations in producing faithful parametric programs and enabling fine-grained analysis.", "url": "https://sciencetostartup.com/paper/benchcad-a-comprehensive-industry-standard-benchmark-for-programmatic-cad", "sameAs": "https://arxiv.org/abs/2605.10865", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.10865" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-11T17:13:36.000Z", "author": [ { "@type": "Person", "name": "Haozhe Zhang" }, { "@type": "Person", "name": "Kaichen Liu" }, { "@type": "Person", "name": "Miaomiao Chen" }, { "@type": "Person", "name": "Lei Li" }, { "@type": "Person", "name": "Shaojie Yang" }, { "@type": "Person", "name": "Cheng Peng" }, { "@type": "Person", "name": "Hanjie Chen" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Programmatic CAD" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Programmatic CAD", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "BenchCAD: A Comprehensive, Industry-Standard Benchmark for P", "item": "https://sciencetostartup.com/paper/benchcad-a-comprehensive-industry-standard-benchmark-for-programmatic-cad" } ] } ] }

Competitive landscape

BenchCAD is a comprehensive benchmark for industrial CAD code generation, revealing current model limitations in producing faithful parametric programs and enabling fine-grained analysis.

Segment

Programmatic CAD

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD

BenchCAD: A Comprehensive, Industry-Standard Benchmark for Programmatic CAD

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline