ARXIV:2605.13484 · LLM CALIBRATION · SUBMITTED 14 MAY · 20:10 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Discovery of Hidden Miscalibration Regimes

Katarzyna Kobalczyk · Mihaela van der Schaar · arXiv

A diagnostic framework to discover and address input-dependent miscalibration in LLMs, improving local confidence correction.

Ship in 2-4 weeks›Score6.0Evidence unverified

Opportunity summary

Pain A diagnostic framework to discover and address input-dependent miscalibration in LLMs, improving local confidence correction.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A diagnostic framework to discover and address input-dependent miscalibration in LLMs, improving local confidence correction. However, this view can hide substantial structure: models may be systematically overconfident on some kinds of inputs and underconfident…

METHOD

Full abstract

Calibration is commonly evaluated by comparing model confidence with its empirical correctness, implicitly treating reliability as a function of the confidence score alone. However, this view can hide substantial structure: models may be systematically overconfident on some kinds of inputs and underconfident on others, causing global reliability diagnostics to obscure localised calibration failures. To address this, we formulate the problem of discovering hidden miscalibration regimes without assuming access to predefined data slices. We define the corresponding miscalibration field and propose a diagnostic framework for estimating it. Our approach learns a calibration-aware representation of the input space and estimates signed local miscalibration by kernel smoothing in the learned geometry. Across four real-world LLM benchmarks and twelve LLMs, we find that input-dependent calibration heterogeneity is prevalent. We further show that the discovered fields are actionable: they support local confidence correction and reduce calibration error in systematically miscalibrated regions where confidence-based methods such as isotonic regression and temperature scaling are less effective.

RESULT

ScienceToStartup currently rates this 6.0/10 on the public viability pass. We further show that the discovered fields are actionable: they support local confidence correction and reduce calibration error in systematically miscalibrated regions where confidence-based…

WHY NOW

LLM Calibration moved forward this cycle; last verified May 2026. Public score 6.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score6.0

PainA diagnostic framework to discover and address input-dependent miscalibration in LLMs, improving local confidence correction.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

A diagnostic framework to discover and address input-dependent miscalibration in LLMs, improving local confidence correction.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A diagnostic framework to discover and address input-dependent miscalibration in LLMs, improving local confidence correction.

Segment

LLM Calibration

Adoption evidence

No public code link in the paper record yet

Commercial read

6.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "0440bee8-7364-4c0d-9b67-504581b5ee84", "arxiv_id": "2605.13484", "canonical_route": "/paper/discovery-of-hidden-miscalibration-regimes", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "discovery-of-hidden-miscalibration-regimes", "endpoints": { "paper_pack": "/api/v1/paper/discovery-of-hidden-miscalibration-regimes/paper-pack", "build_passport": "/api/v1/paper/discovery-of-hidden-miscalibration-regimes/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Discovery of Hidden Miscalibration Regimes", "normalized_query": "2605.13484", "route": "/paper/discovery-of-hidden-miscalibration-regimes", "paper_ref": "discovery-of-hidden-miscalibration-regimes", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/discovery-of-hidden-miscalibration-regimes#webpage", "url": "https://sciencetostartup.com/paper/discovery-of-hidden-miscalibration-regimes", "name": "Discovery of Hidden Miscalibration Regimes", "description": "A diagnostic framework to discover and address input-dependent miscalibration in LLMs, improving local confidence correction.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/discovery-of-hidden-miscalibration-regimes#scholarlyArticle", "headline": "Discovery of Hidden Miscalibration Regimes", "description": "A diagnostic framework to discover and address input-dependent miscalibration in LLMs, improving local confidence correction.", "url": "https://sciencetostartup.com/paper/discovery-of-hidden-miscalibration-regimes", "sameAs": "https://arxiv.org/abs/2605.13484", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.13484" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-13T13:07:50.000Z", "author": [ { "@type": "Person", "name": "Katarzyna Kobalczyk" }, { "@type": "Person", "name": "Mihaela van der Schaar" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 6 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Calibration" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Calibration", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Discovery of Hidden Miscalibration Regimes", "item": "https://sciencetostartup.com/paper/discovery-of-hidden-miscalibration-regimes" } ] } ] }

Competitive landscape

A diagnostic framework to discover and address input-dependent miscalibration in LLMs, improving local confidence correction.

Segment

LLM Calibration

Adoption evidence

No public code link in the paper record yet

Commercial read

6.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Discovery of Hidden Miscalibration Regimes

Discovery of Hidden Miscalibration Regimes

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline