Reasoning Models

Proof pending

7papers

3.9viability

-50%30d

Proof pending

Proof pending. Core topic summary fields are still materializing.

State of the Field

Reasoning models are advancing the capabilities of artificial intelligence in complex problem-solving across various domains, including mathematics and science. Recent developments have focused on enhancing efficiency and accuracy by integrating techniques such as metacognitive reflection, belief engineering, and adaptive thinking. These models are designed to minimize computational redundancy while improving the fidelity of reasoning processes. For builders, this evolution is crucial as it allows for the development of more robust AI systems that can tackle intricate tasks with greater reliability and lower resource requirements. The ongoing research aims to refine these models further, ensuring they can generalize effectively across diverse applications.

Last updated May 27, 2026

Topic-linked question coverage is still building for this proof surface.

Papers

1-7 of 7

Research Paper·Apr 7, 2026

PRISM-MCTS: Learning from Reasoning Trajectories with Metacognitive Reflection

PRISM-MCTS: Learning from Reasoning Trajectories with Metacognitive Reflection Siyuan Cheng, Bozhong Tian, Yanchao Hao, Zheng Wei Published: 06 Apr 2026, Last Modified: 06 Apr 2026 ACL 2026 Findings C...

7.0 viability

Research Paper·Jan 20, 2026

Finding RELIEF: Shaping Reasoning Behavior without Reasoning Supervision via Belief Engineering

Large reasoning models (LRMs) have achieved remarkable success in complex problem-solving, yet they often suffer from computational redundancy or reasoning unfaithfulness. Current methods for shaping ...

6.0 viability

Research Paper·May 13, 2026

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Recent progress in reasoning models has substantially advanced long-horizon mathematical and scientific problem solving, with several systems now reaching gold-medal-level performance on International...

3.0 viability

Research Paper·Feb 26, 2026

Stable Adaptive Thinking via Advantage Shaping and Length-Aware Gradient Regulation

Large reasoning models (LRMs) achieve strong performance through extended reasoning traces, but they often exhibit overthinking behavior for low-complexity queries. Existing efforts to mitigate this i...

3.0 viability

Research Paper·Apr 20, 2026

Learning to Correct: Calibrated Reinforcement Learning for Multi-Attempt Chain-of-Thought

State-of-the-art reasoning models utilize long chain-of-thought (CoT) to solve increasingly complex problems using more test-time computation. In this work, we explore a long CoT setting where the mod...

3.0 viability

Research Paper·Mar 19, 2026

How Uncertainty Estimation Scales with Sampling in Reasoning Models

Uncertainty estimation is critical for deploying reasoning language models, yet remains poorly understood under extended chain-of-thought reasoning. We study parallel sampling as a fully black-box app...

3.0 viability

Research Paper·Jan 15, 2026

Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models

Hierarchical reasoning model (HRM) achieves extraordinary performance on various reasoning tasks, significantly outperforming large language model-based reasoners. To understand the strengths and pote...

2.0 viability

Reasoning Models

Proof pending

State of the Field

Papers

PRISM-MCTS: Learning from Reasoning Trajectories with Metacognitive Reflection

Finding RELIEF: Shaping Reasoning Behavior without Reasoning Supervision via Belief Engineering

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Stable Adaptive Thinking via Advantage Shaping and Length-Aware Gradient Regulation

Learning to Correct: Calibrated Reinforcement Learning for Multi-Attempt Chain-of-Thought

How Uncertainty Estimation Scales with Sampling in Reasoning Models

Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models

Filters

Topic proof surfaces

Reasoning Models

Use this topic page as a durable research-area proof surface