ARXIV:2605.10805 · LLM OPTIMIZATION · SUBMITTED 12 MAY · 20:16 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Reasoning Is Not Free: Robust Adaptive Cost-Efficient Routing for LLM-as-a-Judge

Wenbo Zhang · Lijinghua Zhang · Liner Xiang · Hengrui Cai · arXiv

Develops a routing system to dynamically select between reasoning and non-reasoning LLM judges to optimize accuracy-cost trade-offs under distribution shift.

Blocked on Code›Score4.0Evidence unverified

Opportunity summary

Pain Develops a routing system to dynamically select between reasoning and non-reasoning LLM judges to optimize accuracy-cost trade-offs under distribution shift.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Develops a routing system to dynamically select between reasoning and non-reasoning LLM judges to optimize accuracy-cost trade-offs under distribution shift. Through controlled comparisons between reasoning and non-reasoning judges, we show that explicit reasoning substantially…

METHOD

Full abstract

Reasoning-capable large language models (LLMs) have recently been adopted as automated judges, but their benefits and costs in LLM-as-a-Judge settings remain unclear. Through controlled comparisons between reasoning and non-reasoning judges, we show that explicit reasoning substantially improves judgment accuracy on tasks requiring structured verification (e.g., math and coding), while offering limited or even negative gains on simpler evaluations and incurring significantly higher computational cost. These findings motivate that reasoning should be used selectively rather than universally, with awareness of possible distribution shift. We propose a Robust Adaptive Cost-Efficient Routing (RACER), which dynamically selects between reasoning and non-reasoning judges under a fixed budget by formulating routing as a constrained distributionally robust optimization problem. RACER explicitly accounts for distribution shift via a KL-divergence uncertainty set, admits an efficient primal--dual algorithm, and enjoys theoretical guarantees including uniqueness of the optimal policy and linear convergence. Extensive experiments show that RACER achieves superior accuracy--cost trade-offs under distribution shift.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. Through controlled comparisons between reasoning and non-reasoning judges, we show that explicit reasoning substantially improves judgment accuracy on tasks requiring structured verification (e.g., math…

WHY NOW

LLM Optimization moved forward this cycle; last verified May 2026. Public score 4.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainDevelops a routing system to dynamically select between reasoning and non-reasoning LLM judges to optimize accuracy-cost trade-offs under distribution shift.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

Develops a routing system to dynamically select between reasoning and non-reasoning LLM judges to optimize accuracy-cost trade-offs under distribution shift.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Develops a routing system to dynamically select between reasoning and non-reasoning LLM judges to optimize accuracy-cost trade-offs under distribution shift.

Segment

LLM Optimization

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "37109e9f-81e6-4c9e-a480-71b34340da72", "arxiv_id": "2605.10805", "canonical_route": "/paper/reasoning-is-not-free-robust-adaptive-cost-efficient-routing-for-llm-as-a-judge", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "reasoning-is-not-free-robust-adaptive-cost-efficient-routing-for-llm-as-a-judge", "endpoints": { "paper_pack": "/api/v1/paper/reasoning-is-not-free-robust-adaptive-cost-efficient-routing-for-llm-as-a-judge/paper-pack", "build_passport": "/api/v1/paper/reasoning-is-not-free-robust-adaptive-cost-efficient-routing-for-llm-as-a-judge/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Reasoning Is Not Free: Robust Adaptive Cost-Efficient Routing for LLM-as-a-Judge", "normalized_query": "2605.10805", "route": "/paper/reasoning-is-not-free-robust-adaptive-cost-efficient-routing-for-llm-as-a-judge", "paper_ref": "reasoning-is-not-free-robust-adaptive-cost-efficient-routing-for-llm-as-a-judge", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/reasoning-is-not-free-robust-adaptive-cost-efficient-routing-for-llm-as-a-judge#webpage", "url": "https://sciencetostartup.com/paper/reasoning-is-not-free-robust-adaptive-cost-efficient-routing-for-llm-as-a-judge", "name": "Reasoning Is Not Free: Robust Adaptive Cost-Efficient Routing for LLM-as-a-Judge", "description": "Develops a routing system to dynamically select between reasoning and non-reasoning LLM judges to optimize accuracy-cost trade-offs under distribution shift.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/reasoning-is-not-free-robust-adaptive-cost-efficient-routing-for-llm-as-a-judge#scholarlyArticle", "headline": "Reasoning Is Not Free: Robust Adaptive Cost-Efficient Routing for LLM-as-a-Judge", "description": "Develops a routing system to dynamically select between reasoning and non-reasoning LLM judges to optimize accuracy-cost trade-offs under distribution shift.", "url": "https://sciencetostartup.com/paper/reasoning-is-not-free-robust-adaptive-cost-efficient-routing-for-llm-as-a-judge", "sameAs": "https://arxiv.org/abs/2605.10805", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.10805" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-11T16:30:20.000Z", "author": [ { "@type": "Person", "name": "Wenbo Zhang" }, { "@type": "Person", "name": "Lijinghua Zhang" }, { "@type": "Person", "name": "Liner Xiang" }, { "@type": "Person", "name": "Hengrui Cai" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Optimization" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Optimization", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Reasoning Is Not Free: Robust Adaptive Cost-Efficient Routin", "item": "https://sciencetostartup.com/paper/reasoning-is-not-free-robust-adaptive-cost-efficient-routing-for-llm-as-a-judge" } ] } ] }

Competitive landscape

Develops a routing system to dynamically select between reasoning and non-reasoning LLM judges to optimize accuracy-cost trade-offs under distribution shift.

Segment

LLM Optimization

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Reasoning Is Not Free: Robust Adaptive Cost-Efficient Routing for LLM-as-a-Judge

Reasoning Is Not Free: Robust Adaptive Cost-Efficient Routing for LLM-as-a-Judge

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline