ARXIV:2601.21115 · CODE LLMS · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Multi-task Code LLMs: Data Mix or Model Merge?

arXiv

Develop high-performance, multi-task code generation models using data mixing or model merging strategies for efficient AI deployment.

Blocked on Code›Score5.0Evidence unverified

Opportunity summary

Pain Develop high-performance, multi-task code generation models using data mixing or model merging strategies for efficient AI deployment.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Develop high-performance, multi-task code generation models using data mixing or model merging strategies for efficient AI deployment. We compare two approaches for creating small, multi-task code LLMs: data mixing versus model merging.

METHOD

Full abstract

Recent research advocates deploying smaller, specialized code LLMs in agentic frameworks alongside frontier models, sparking interest in efficient strategies for multi-task learning that balance performance, constraints, and costs. We compare two approaches for creating small, multi-task code LLMs: data mixing versus model merging. We conduct extensive experiments across two model families (Qwen Coder and DeepSeek Coder) at two scales (2B and 7B parameters), fine-tuning them for code generation and code summarization tasks. Our evaluation on HumanEval, MBPP, and CodeXGlue benchmarks reveals that model merging achieves the best overall performance at larger scale across model families, retaining 96% of specialized model performance on code generation tasks while maintaining summarization capabilities. Notably, merged models can even surpass individually fine-tuned models, with our best configuration of Qwen Coder 2.5 7B model achieving 92.7% Pass@1 on HumanEval compared to 90.9% for its task-specific fine-tuned equivalent. At a smaller scale we find instead data mixing to be a preferred strategy. We further introduce a weight analysis technique to understand how different tasks affect model parameters and their implications for merging strategies. The results suggest that careful merging and mixing strategies can effectively combine task-specific capabilities without significant performance degradation, making them ideal for resource-constrained deployment scenarios.

RESULT

ScienceToStartup currently rates this 5.0/10 on the public viability pass. Our evaluation on HumanEval, MBPP, and CodeXGlue benchmarks reveals that model merging achieves the best overall performance at larger scale across model families, retaining…

WHY NOW

Code LLMs moved forward this cycle; last verified April 2026. Public score 5.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score5.0

PainDevelop high-performance, multi-task code generation models using data mixing or model merging strategies for efficient AI deployment.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

Develop high-performance, multi-task code generation models using data mixing or model merging strategies for efficient AI deployment.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

Develop high-performance, multi-task code generation models using data mixing or model merging strategies for efficient AI deployment.

Segment

Code LLMs

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "95928ced-4d35-441c-9222-5b4b972686b4", "arxiv_id": "2601.21115", "canonical_route": "/paper/multi-task-code-llms-data-mix-or-model-merge", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "multi-task-code-llms-data-mix-or-model-merge", "endpoints": { "paper_pack": "/api/v1/paper/multi-task-code-llms-data-mix-or-model-merge/paper-pack", "build_passport": "/api/v1/paper/multi-task-code-llms-data-mix-or-model-merge/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Multi-task Code LLMs: Data Mix or Model Merge?", "normalized_query": "2601.21115", "route": "/paper/multi-task-code-llms-data-mix-or-model-merge", "paper_ref": "multi-task-code-llms-data-mix-or-model-merge", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/multi-task-code-llms-data-mix-or-model-merge#webpage", "url": "https://sciencetostartup.com/paper/multi-task-code-llms-data-mix-or-model-merge", "name": "Multi-task Code LLMs: Data Mix or Model Merge?", "description": "Develop high-performance, multi-task code generation models using data mixing or model merging strategies for efficient AI deployment.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/multi-task-code-llms-data-mix-or-model-merge#scholarlyArticle", "headline": "Multi-task Code LLMs: Data Mix or Model Merge?", "description": "Develop high-performance, multi-task code generation models using data mixing or model merging strategies for efficient AI deployment.", "url": "https://sciencetostartup.com/paper/multi-task-code-llms-data-mix-or-model-merge", "sameAs": "https://arxiv.org/abs/2601.21115", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2601.21115" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-01-28T23:06:09.000Z", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 5 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Code LLMs" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Code LLMs", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Multi-task Code LLMs: Data Mix or Model Merge?", "item": "https://sciencetostartup.com/paper/multi-task-code-llms-data-mix-or-model-merge" } ] } ] }

Competitive landscape

Develop high-performance, multi-task code generation models using data mixing or model merging strategies for efficient AI deployment.

Segment

Code LLMs

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Multi-task Code LLMs: Data Mix or Model Merge?

Multi-task Code LLMs: Data Mix or Model Merge?

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline