ARXIV:2603.28135 · REASONING AGENTS · SUBMITTED 31 MAR · 20:19 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning

Siyuan Ma · Bo Gao · Zikai Xiao · Hailong Wang · Xinlei Yu · Rui Qian · +3 at arXiv

A training-free metacognitive framework that intelligently controls and optimizes test-time reasoning for improved accuracy and compute efficiency across diverse benchmarks.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A training-free metacognitive framework that intelligently controls and optimizes test-time reasoning for improved accuracy and compute efficiency across diverse benchmarks.

Evidence 24 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A training-free metacognitive framework that intelligently controls and optimizes test-time reasoning for improved accuracy and compute efficiency across diverse benchmarks. We introduce CoT2-Meta, a training-free metacognitive reasoning framework that combines object-level chain-of-thought generation with…

METHOD

Full abstract

Recent test-time reasoning methods improve performance by generating more candidate chains or searching over larger reasoning trees, but they typically lack explicit control over when to expand, what to prune, how to repair, and when to abstain. We introduce CoT2-Meta, a training-free metacognitive reasoning framework that combines object-level chain-of-thought generation with meta-level control over partial reasoning trajectories. The framework integrates four components: strategy-conditioned thought generation, tree-structured search, an online process oracle for step-level reasoning evaluation, and a meta-controller that allocates computation through expansion, pruning, repair, stopping, and fallback decisions. Under matched inference budgets, CoT2-Meta consistently outperforms strong single-path, sampling-based, and search-based baselines, including ReST-MCTS. On the default backbone, it achieves 92.8 EM on MATH, 90.4 accuracy on GPQA, 98.65 EM on GSM8K, 75.8 accuracy on BBEH, 85.6 accuracy on MMMU-Pro, and 48.8 accuracy on HLE, with gains over the strongest non-CoT2-Meta baseline of +3.6, +5.2, +1.15, +2.0, +4.3, and +4.3 points, respectively. Beyond these core results, the framework remains effective across a broader 15-benchmark suite spanning knowledge and QA, multi-hop reasoning, coding, and out-of-distribution evaluation. Additional analyses show better compute scaling, improved calibration, stronger selective prediction, targeted repair behavior, and consistent gains across backbone families. These results suggest that explicit metacognitive control is a practical design principle for reliable and compute-efficient test-time reasoning systems.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Recent test-time reasoning methods improve performance by generating more candidate chains or searching over larger reasoning trees, but they typically lack explicit control over…

WHY NOW

Reasoning Agents moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA training-free metacognitive framework that intelligently controls and optimizes test-time reasoning for improved accuracy and compute efficiency across diverse benchmarks.

Evidence24 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A training-free metacognitive framework that intelligently controls and optimizes test-time reasoning for improved accuracy and compute efficiency across diverse benchmarks.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A training-free metacognitive framework that intelligently controls and optimizes test-time reasoning for improved accuracy and compute efficiency across diverse benchmarks.

Segment

Reasoning Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "cde1281b-add1-4433-ad84-6ac3e02cd71c", "arxiv_id": "2603.28135", "canonical_route": "/paper/cot2-meta-budgeted-metacognitive-control-for-test-time-reasoning", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "cot2-meta-budgeted-metacognitive-control-for-test-time-reasoning", "endpoints": { "paper_pack": "/api/v1/paper/cot2-meta-budgeted-metacognitive-control-for-test-time-reasoning/paper-pack", "build_passport": "/api/v1/paper/cot2-meta-budgeted-metacognitive-control-for-test-time-reasoning/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning", "normalized_query": "2603.28135", "route": "/paper/cot2-meta-budgeted-metacognitive-control-for-test-time-reasoning", "paper_ref": "cot2-meta-budgeted-metacognitive-control-for-test-time-reasoning", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/cot2-meta-budgeted-metacognitive-control-for-test-time-reasoning#webpage", "url": "https://sciencetostartup.com/paper/cot2-meta-budgeted-metacognitive-control-for-test-time-reasoning", "name": "CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning", "description": "A training-free metacognitive framework that intelligently controls and optimizes test-time reasoning for improved accuracy and compute efficiency across diverse benchmarks.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/cot2-meta-budgeted-metacognitive-control-for-test-time-reasoning#scholarlyArticle", "headline": "CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning", "description": "A training-free metacognitive framework that intelligently controls and optimizes test-time reasoning for improved accuracy and compute efficiency across diverse benchmarks.", "url": "https://sciencetostartup.com/paper/cot2-meta-budgeted-metacognitive-control-for-test-time-reasoning", "sameAs": "https://arxiv.org/abs/2603.28135", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.28135" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-30T07:59:47.000Z", "author": [ { "@type": "Person", "name": "Siyuan Ma" }, { "@type": "Person", "name": "Bo Gao" }, { "@type": "Person", "name": "Zikai Xiao" }, { "@type": "Person", "name": "Hailong Wang" }, { "@type": "Person", "name": "Xinlei Yu" }, { "@type": "Person", "name": "Rui Qian" }, { "@type": "Person", "name": "Jiayu Qian" }, { "@type": "Person", "name": "Luqi Gong" }, { "@type": "Person", "name": "Yang Liu" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Reasoning Agents" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Reasoning Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reas", "item": "https://sciencetostartup.com/paper/cot2-meta-budgeted-metacognitive-control-for-test-time-reasoning" } ] } ] }

Competitive landscape

A training-free metacognitive framework that intelligently controls and optimizes test-time reasoning for improved accuracy and compute efficiency across diverse benchmarks.

Segment

Reasoning Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning

CoT2-Meta: Budgeted Metacognitive Control for Test-Time Reasoning

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline