AI Research

TrendingProof pending

13papers

2.9viability

+100%30d

Proof pending

Proof pending. Core topic summary fields are still materializing.

State of the Field

Current research in AI focuses on enhancing the capabilities of large language models (LLMs) to improve their generalization and alignment with human values. Studies reveal limitations in LLMs' ability to generalize periodicity and their lack of a coherent Theory of Mind, which affects their social reasoning. New normalization techniques and independent modules are being developed to stabilize learning and ensure consistent value guidance. Additionally, advancements in latent reasoning and controllable information production aim to refine how models learn from human feedback and optimize their performance. These developments are crucial for builders looking to create more reliable and effective AI systems that can operate in diverse and complex environments.

Last updated May 22, 2026

Topic-linked question coverage is still building for this proof surface.

Topic trend

Topic-specific paper and score movement from the daily diff ledger.

Papers

1-10 of 13

Research Paper·Jan 30, 2026

Do Transformers Have the Ability for Periodicity Generalization?

Large language models (LLMs) based on the Transformer have demonstrated strong performance across diverse tasks. However, current models still exhibit substantial limitations in out-of-distribution (O...

5.0 viability

Research Paper·Feb 4, 2026·B2B

Enhanced QKNorm normalization for neural transformers with the Lp norm

The normalization of query and key vectors is an essential part of the Transformer architecture. It ensures that learning is stable regardless of the scale of these vectors. Some normalization approac...

4.0 viability

Research Paper·Feb 12, 2026

GPT-4o Lacks Core Features of Theory of Mind

Do Large Language Models (LLMs) possess a Theory of Mind (ToM)? Research into this question has focused on evaluating LLMs against benchmarks and found success across a range of social tasks. However,...

4.0 viability

Research Paper·Jan 28, 2026

Investigating the Development of Task-Oriented Communication in Vision-Language Models

We investigate whether \emph{LLM-based agents} can develop task-oriented communication protocols that differ from standard natural language in collaborative reasoning tasks. Our focus is on two core p...

3.0 viability

Research Paper·May 26, 2026

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

We introduce the MiniMax-M2 series, a family of Mixture-of-Experts language models built around the principle that mini activations can unleash maximum real-world intelligence. The flagship M2 contain...

3.0 viability

Research Paper·Jan 30, 2026

Controllable Information Production

Intrinsic Motivation (IM) is a paradigm for generating intelligent behavior without external utilities. The existing information-theoretic methods for IM are predominantly based on information transmi...

3.0 viability

Research Paper·Jan 29, 2026

Latent Adversarial Regularization for Offline Preference Optimization

Learning from human feedback typically relies on preference optimization that constrains policy updates through token-level regularization. However, preference optimization for language models is part...

3.0 viability

Research Paper·Mar 5, 2026

The Spike, the Sparse and the Sink: Anatomy of Massive Activations and Attention Sinks

We study two recurring phenomena in Transformer language models: massive activations, in which a small number of tokens exhibit extreme outliers in a few channels, and attention sinks, in which certai...

3.0 viability

Research Paper·May 12, 2026

Toward Stable Value Alignment: Introducing Independent Modules for Consistent Value Guidance

Aligning large language models (LLMs) with human values typically relies on post-training or inference-time steering that directly manipulates the backbone's parameters or representation space. Howeve...

3.0 viabilityHas code

Research Paper·Mar 3, 2026

Retrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures

Transformers excel at in-context retrieval but suffer from quadratic complexity with sequence length, while State Space Models (SSMs) offer efficient linear-time processing but have limited retrieval ...

2.0 viability

Page 1 of 2

AI Research

Proof pending

State of the Field

Topic trend

Papers

Do Transformers Have the Ability for Periodicity Generalization?

Enhanced QKNorm normalization for neural transformers with the Lp norm

GPT-4o Lacks Core Features of Theory of Mind

Investigating the Development of Task-Oriented Communication in Vision-Language Models

The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence

Controllable Information Production

Latent Adversarial Regularization for Offline Preference Optimization

The Spike, the Sparse and the Sink: Anatomy of Massive Activations and Attention Sinks

Toward Stable Value Alignment: Introducing Independent Modules for Consistent Value Guidance

Retrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures

Filters

Topic proof surfaces

AI Research

Use this topic page as a durable research-area proof surface