Model Merging

TrendingProof pending

9papers

5.2viability

+100%30d

Proof pending

Proof pending. Core topic summary fields are still materializing.

State of the Field

Model merging is a technique that integrates multiple specialized models into a single model, allowing for efficient knowledge consolidation without the need for data sharing or retraining. This approach is particularly relevant in fields where data privacy is a concern, as it enables the use of domain-adaptive methods to maintain model performance across diverse tasks. Recent advancements have focused on improving merging stability and performance through various frameworks that address issues such as parameter interference and directional consistency. Techniques like Sparse Complementary Fusion and Subspace-Aware Merging have shown promise in enhancing generalization capabilities. For builders, these developments in model merging represent a significant opportunity to leverage existing models more effectively, streamline the development process, and create robust solutions that can adapt to new challenges without incurring high computational costs.

Last updated May 23, 2026

Topic-linked question coverage is still building for this proof surface.

Papers

1-9 of 9

Research Paper·Mar 6, 2026

Domain-Adaptive Model Merging across Disconnected Modes

Learning across domains is challenging when data cannot be centralized due to privacy or heterogeneity, which limits the ability to train a single comprehensive model. Model merging provides an appeal...

7.0 viability

Research Paper·Mar 6, 2026

DC-Merge: Improving Model Merging with Directional Consistency

Model merging aims to integrate multiple task-adapted models into a unified model that preserves the knowledge of each task. In this paper, we identify that the key to this knowledge retention lies in...

7.0 viability

Research Paper·Jan 14, 2026

SimMerge: Learning to Select Merge Operators from Similarity Signals

Model merging enables multiple large language models (LLMs) to be combined into a single model while preserving performance. This makes it a valuable tool in LLM development, offering a competitive al...

6.0 viability

Research Paper·Feb 12, 2026

Beyond Parameter Arithmetic: Sparse Complementary Fusion for Distribution-Aware Model Merging

Model merging has emerged as a promising paradigm for composing the capabilities of large language models by directly operating in weight space, enabling the integration of specialized models without ...

6.0 viability

Research Paper·Mar 6, 2026

Bridging Domains through Subspace-Aware Model Merging

Model merging integrates multiple task-specific models into a single consolidated one. Recent research has made progress in improving merging performance for in-distribution or multi-task scenarios, b...

5.0 viability

Research Paper·Feb 9, 2026

Sparsity-Aware Evolution for Model Merging

We propose a sparsity-aware evolutionary (SAE) framework for model merging that involves iterative pruning-merging cycles to act as a novel mutation operator. We incorporate the sparsity constraints i...

5.0 viability

Research Paper·Mar 10, 2026

Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions

Model merging has emerged as a transformative paradigm for combining the capabilities of multiple neural networks into a single unified model without additional training. With the rapid proliferation ...

4.0 viability

Research Paper·Mar 10, 2026

An Empirical Study and Theoretical Explanation on Task-Level Model-Merging Collapse

Model merging unifies independently fine-tuned LLMs from the same base, enabling reuse and integration of parallel development efforts without retraining. However, in practice we observe that merging ...

4.0 viability

Research Paper·May 26, 2026

Model Merging on Loss Landscape: A Geometry Perspective

Model merging offers a promising avenue for knowledge integration and parallel development without retraining. Yet, existing methods either ignore the geometry of the loss landscape or rely on intract...

3.0 viability

Model Merging

Proof pending

State of the Field

Papers

Domain-Adaptive Model Merging across Disconnected Modes

DC-Merge: Improving Model Merging with Directional Consistency

SimMerge: Learning to Select Merge Operators from Similarity Signals

Beyond Parameter Arithmetic: Sparse Complementary Fusion for Distribution-Aware Model Merging

Bridging Domains through Subspace-Aware Model Merging

Sparsity-Aware Evolution for Model Merging

Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions

An Empirical Study and Theoretical Explanation on Task-Level Model-Merging Collapse

Model Merging on Loss Landscape: A Geometry Perspective

Filters

Topic proof surfaces

Model Merging

Use this topic page as a durable research-area proof surface