ARXIV:2606.03303 · FORMAL MATHEMATICS LLMS · SUBMITTED 03 JUN · 20:41 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

LEAP: Supercharging LLMs for Formal Mathematics with Agentic Frameworks

Po-Nien Kung · Linfeng Song · Dawsen Hwang · Jinsung Yoon · Chun-Liang Li · Simone Severini · +7 at arXiv

LEAP is an agentic framework that supercharges general LLMs to achieve state-of-the-art performance in formal mathematics theorem proving, solving all problems on the 2025 Putnam Competition.

Ship in 2-4 weeks›Score8.0Evidence unverified

Opportunity summary

Pain LEAP is an agentic framework that supercharges general LLMs to achieve state-of-the-art performance in formal mathematics theorem proving, solving all problems on the 2025 Putnam Competition.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

LEAP is an agentic framework that supercharges general LLMs to achieve state-of-the-art performance in formal mathematics theorem proving, solving all problems on the 2025 Putnam Competition. We present LEAP, an agentic framework that enables…

METHOD

Full abstract

Large Language Models (LLMs) exhibit strong informal mathematical reasoning but struggle to generate mechanically verifiable proofs in formal languages like Lean. We present LEAP, an agentic framework that enables general-purpose foundation models to achieve state-of-the-art performance on automated formal theorem proving. LEAP leverages foundation model capabilities, such as informal reasoning, instruction following, and iterative self-refinement. By decomposing complex problems into smaller units, the system bridges formal proof construction with informal blueprints through continuous interaction with the Lean compiler. To provide a rigorous evaluation beyond increasingly saturated benchmarks, we introduce Lean-IMO-Bench, a benchmark of IMO-style problems formalized in Lean, with short statements yet highly non-routine and multi-step proofs across a wide range of difficulty levels. Empirically, on the latest 2025 Putnam Competition, an annual mathematics competition for undergraduate students in North America, LEAP solves all 12 problems, matching recent breakthroughs by frontier formal mathematical models. On Lean-IMO-Bench, LEAP boosts the one-shot formal solve rate of general-purpose LLMs from below 10% to 70%, notably surpassing the 48% benchmark set by a specialized, gold-medal-caliber IMO system. Furthermore, we demonstrate LEAP's research-level utility by autonomously formalizing complex proofs for open combinatorial challenges, including a verified proof for a key subproblem in Knuth's Hamiltonian decomposition of even-order Cayley graphs.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. We present LEAP, an agentic framework that enables general-purpose foundation models to achieve state-of-the-art performance on automated formal theorem proving. Code availability is flagged…

WHY NOW

Formal Mathematics LLMs moved forward this cycle; last verified June 2026. Public score 8.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainLEAP is an agentic framework that supercharges general LLMs to achieve state-of-the-art performance in formal mathematics theorem proving, solving all problems on the 2025 Putnam Competition.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

LEAP is an agentic framework that supercharges general LLMs to achieve state-of-the-art performance in formal mathematics theorem proving, solving all problems on the 2025 Putnam Competition.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

LEAP: Supercharging LLMs for Formal Mathematics with Agentic Frameworks

Po-Nien Kung · Linfeng Song · Dawsen Hwang · Jinsung Yoon · Chun-Liang Li · Simone Severini · +7 at arXiv

LEAP is an agentic framework that supercharges general LLMs to achieve state-of-the-art performance in formal mathematics theorem proving, solving all problems on the 2025 Putnam Competition.

Competitive landscape

LEAP is an agentic framework that supercharges general LLMs to achieve state-of-the-art performance in formal mathematics theorem proving, solving all problems on the 2025 Putnam Competition.

Segment

Formal Mathematics LLMs

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "b019b432-8521-494e-b831-0765bd92f75d", "arxiv_id": "2606.03303", "canonical_route": "/paper/leap-supercharging-llms-for-formal-mathematics-with-agentic-frameworks", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "leap-supercharging-llms-for-formal-mathematics-with-agentic-frameworks", "endpoints": { "paper_pack": "/api/v1/paper/leap-supercharging-llms-for-formal-mathematics-with-agentic-frameworks/paper-pack", "build_passport": "/api/v1/paper/leap-supercharging-llms-for-formal-mathematics-with-agentic-frameworks/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "LEAP: Supercharging LLMs for Formal Mathematics with Agentic Frameworks", "normalized_query": "2606.03303", "route": "/paper/leap-supercharging-llms-for-formal-mathematics-with-agentic-frameworks", "paper_ref": "leap-supercharging-llms-for-formal-mathematics-with-agentic-frameworks", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/leap-supercharging-llms-for-formal-mathematics-with-agentic-frameworks#webpage", "url": "https://sciencetostartup.com/paper/leap-supercharging-llms-for-formal-mathematics-with-agentic-frameworks", "name": "LEAP: Supercharging LLMs for Formal Mathematics with Agentic Frameworks", "description": "LEAP is an agentic framework that supercharges general LLMs to achieve state-of-the-art performance in formal mathematics theorem proving, solving all problems on the 2025 Putnam Competition.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/leap-supercharging-llms-for-formal-mathematics-with-agentic-frameworks#scholarlyArticle", "headline": "LEAP: Supercharging LLMs for Formal Mathematics with Agentic Frameworks", "description": "LEAP is an agentic framework that supercharges general LLMs to achieve state-of-the-art performance in formal mathematics theorem proving, solving all problems on the 2025 Putnam Competition.", "url": "https://sciencetostartup.com/paper/leap-supercharging-llms-for-formal-mathematics-with-agentic-frameworks", "sameAs": "https://arxiv.org/abs/2606.03303", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2606.03303" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-06-02T08:16:42.000Z", "author": [ { "@type": "Person", "name": "Po-Nien Kung" }, { "@type": "Person", "name": "Linfeng Song" }, { "@type": "Person", "name": "Dawsen Hwang" }, { "@type": "Person", "name": "Jinsung Yoon" }, { "@type": "Person", "name": "Chun-Liang Li" }, { "@type": "Person", "name": "Simone Severini" }, { "@type": "Person", "name": "Mirek Olšák" }, { "@type": "Person", "name": "Edward Lockhart" }, { "@type": "Person", "name": "Quoc V Le" }, { "@type": "Person", "name": "Burak Gokturk" }, { "@type": "Person", "name": "Thang Luong" }, { "@type": "Person", "name": "Tomas Pfister" }, { "@type": "Person", "name": "Nanyun Peng" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Formal Mathematics LLMs" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Formal Mathematics LLMs", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "LEAP: Supercharging LLMs for Formal Mathematics with Agentic", "item": "https://sciencetostartup.com/paper/leap-supercharging-llms-for-formal-mathematics-with-agentic-frameworks" } ] } ] }

Competitive landscape

LEAP is an agentic framework that supercharges general LLMs to achieve state-of-the-art performance in formal mathematics theorem proving, solving all problems on the 2025 Putnam Competition.

Segment

Formal Mathematics LLMs

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

LEAP: Supercharging LLMs for Formal Mathematics with Agentic Frameworks

LEAP: Supercharging LLMs for Formal Mathematics with Agentic Frameworks

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline