Language Models

Proof pending

20papers

4.7viability

-100%30d

Proof pending

Proof pending. Core topic summary fields are still materializing.

State of the Field

Current research on language models is increasingly focused on enhancing accessibility and performance for low-resource languages, as evidenced by recent work that introduces efficient training pipelines for small language models tailored to specific linguistic needs. Innovations like value-aware numerical representations and modular reasoning frameworks are addressing fundamental limitations in numerical understanding and reasoning efficiency, promising to improve model robustness across various tasks. Additionally, the exploration of multimodal capabilities is gaining traction, with new paradigms that integrate visual inputs and advanced search techniques to enhance comprehension and interaction in noisy environments. This shift not only aims to bridge gaps in language representation but also to optimize performance in real-world applications, such as legal document processing and complex question answering. As the field evolves, the emphasis on tailored, efficient models signals a move towards practical deployment in diverse linguistic and contextual settings, potentially transforming how language technology serves global communities.

Last updated May 29, 2026

Topic-linked question coverage is still building for this proof surface.

Papers

1-10 of 20

Research Paper·Jan 20, 2026

Kakugo: Distillation of Low-Resource Languages into Small Language Models

We present Kakugo, a novel and cost-effective pipeline designed to train general-purpose Small Language Models (SLMs) for low-resource languages using only the language name as input. By using a large...

8.0 viability

Research Paper·Feb 26, 2026

Test-Time Scaling with Diffusion Language Models via Reward-Guided Stitching

Reasoning with large language models often benefits from generating multiple chains-of-thought, but existing aggregation strategies are typically trajectory-level (e.g., selecting the best trace or vo...

8.0 viability

Research Paper·Jan 14, 2026

Value-Aware Numerical Representations for Transformer Language Models

Transformer-based language models often achieve strong results on mathematical reasoning benchmarks while remaining fragile on basic numerical understanding and arithmetic operations. A central limita...

6.0 viability

Research Paper·Mar 3, 2026

Raising Bars, Not Parameters: LilMoo Compact Language Model for Hindi

The dominance of large multilingual foundation models has widened linguistic inequalities in Natural Language Processing (NLP), often leaving low-resource languages underrepresented. This paper introd...

6.0 viability

Research Paper·Mar 10, 2026

Sabiá-4 Technical Report

This technical report presents Sabiá-4 and Sabiazinho-4, a new generation of Portuguese language models with a focus on Brazilian Portuguese language. The models were developed through a four-stage tr...

6.0 viability

Research Paper·Jan 14, 2026

A.X K1 Technical Report

We introduce A.X K1, a 519B-parameter Mixture-of-Experts (MoE) language model trained from scratch. Our design leverages scaling laws to optimize training configurations and vocabulary size under fixe...

6.0 viability

Research Paper·Jan 14, 2026

Hot-Start from Pixels: Low-Resolution Visual Tokens for Chinese Language Modeling

Large language models typically represent Chinese characters as discrete index-based tokens, largely ignoring their visual form. For logographic scripts, visual structure carries semantic and phonetic...

6.0 viability

Research Paper·Apr 13, 2026

Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language Models

Large language models exhibit sycophantic tendencies--validating incorrect user beliefs to appear agreeable. We investigate whether this behavior varies systematically with perceived user demographics...

5.0 viability

Research Paper·Jan 22, 2026

Parallelism and Generation Order in Masked Diffusion Language Models: Limits Today, Potential Tomorrow

Masked Diffusion Language Models (MDLMs) promise parallel token generation and arbitrary-order decoding, yet it remains unclear to what extent current models truly realize these capabilities. We chara...

5.0 viability

Research Paper·Jan 29, 2026

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Multimodal large language models (MLLMs) have achieved remarkable success across a broad range of vision tasks. However, constrained by the capacity of their internal world knowledge, prior work has p...

5.0 viability

Page 1 of 2

Language Models

Proof pending

State of the Field

Papers

Kakugo: Distillation of Low-Resource Languages into Small Language Models

Test-Time Scaling with Diffusion Language Models via Reward-Guided Stitching

Value-Aware Numerical Representations for Transformer Language Models

Raising Bars, Not Parameters: LilMoo Compact Language Model for Hindi

Sabiá-4 Technical Report

A.X K1 Technical Report

Hot-Start from Pixels: Low-Resolution Visual Tokens for Chinese Language Modeling

Intersectional Sycophancy: How Perceived User Demographics Shape False Validation in Large Language Models

Parallelism and Generation Order in Masked Diffusion Language Models: Limits Today, Potential Tomorrow

Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models

Filters

Topic proof surfaces

Language Models

Use this topic page as a durable research-area proof surface