ARXIV:2604.01601 · LLM TRAINING · SUBMITTED 03 APR · 20:50 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Training In-Context and In-Weights Mixtures Via Contrastive Context Sampling

Deeptanshu Malu · Deevyanshu Malu · Aditya Nemiwal · Sunita Sarawagi · arXiv

A novel training strategy for LLMs that balances in-context and in-weights learning by using contrastive context sampling to improve performance and prevent label copying.

Blocked on Code›Score3.0Evidence unverified

Opportunity summary

Pain A novel training strategy for LLMs that balances in-context and in-weights learning by using contrastive context sampling to improve performance and prevent label copying.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A novel training strategy for LLMs that balances in-context and in-weights learning by using contrastive context sampling to improve performance and prevent label copying. Although current LLMs exhibit both modes, standard task-specific fine-tuning often…

METHOD

Full abstract

We investigate training strategies that co-develop in-context learning (ICL) and in-weights learning (IWL), and the ability to switch between them based on context relevance. Although current LLMs exhibit both modes, standard task-specific fine-tuning often erodes ICL, motivating IC-Train - fine-tuning with in-context examples. Prior work has shown that emergence of ICL after IC-Train depends on factors such as task diversity and training duration. In this paper we show that the similarity structure between target inputs and context examples also plays an important role. Random context leads to loss of ICL and IWL dominance, while only similar examples in context causes ICL to degenerate to copying labels without regard to relevance. To address this, we propose a simple Contrastive-Context which enforces two types of contrasts: (1) mix of similar and random examples within a context to evolve a correct form of ICL, and (2) varying grades of similarity across contexts to evolve ICL-IWL mixtures. We present insights on the importance of such contrast with theoretical analysis of a minimal model. We validate with extensive empirical evaluation on four LLMs and several tasks. Diagnostic probes confirm that contrasted contexts yield stable ICL-IWL mixtures, avoiding collapse into pure ICL, IWL, or copying.

RESULT

ScienceToStartup currently rates this 3.0/10 on the public viability pass. In this paper we show that the similarity structure between target inputs and context examples also plays an important role.

WHY NOW

LLM Training moved forward this cycle; last verified April 2026. Public score 3.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score3.0

PainA novel training strategy for LLMs that balances in-context and in-weights learning by using contrastive context sampling to improve performance and prevent label copying.

Evidence0 refs | 0 sources | 33% coverage

Blockerno shell-level blocker reported

Analysis summary

A novel training strategy for LLMs that balances in-context and in-weights learning by using contrastive context sampling to improve performance and prevent label copying.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A novel training strategy for LLMs that balances in-context and in-weights learning by using contrastive context sampling to improve performance and prevent label copying.

Segment

LLM Training

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "fb871846-4d65-4089-b019-4a45fe0be63f", "arxiv_id": "2604.01601", "canonical_route": "/paper/training-in-context-and-in-weights-mixtures-via-contrastive-context-sampling", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "training-in-context-and-in-weights-mixtures-via-contrastive-context-sampling", "endpoints": { "paper_pack": "/api/v1/paper/training-in-context-and-in-weights-mixtures-via-contrastive-context-sampling/paper-pack", "build_passport": "/api/v1/paper/training-in-context-and-in-weights-mixtures-via-contrastive-context-sampling/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Training In-Context and In-Weights Mixtures Via Contrastive Context Sampling", "normalized_query": "2604.01601", "route": "/paper/training-in-context-and-in-weights-mixtures-via-contrastive-context-sampling", "paper_ref": "training-in-context-and-in-weights-mixtures-via-contrastive-context-sampling", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/training-in-context-and-in-weights-mixtures-via-contrastive-context-sampling#webpage", "url": "https://sciencetostartup.com/paper/training-in-context-and-in-weights-mixtures-via-contrastive-context-sampling", "name": "Training In-Context and In-Weights Mixtures Via Contrastive Context Sampling", "description": "A novel training strategy for LLMs that balances in-context and in-weights learning by using contrastive context sampling to improve performance and prevent label copying.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/training-in-context-and-in-weights-mixtures-via-contrastive-context-sampling#scholarlyArticle", "headline": "Training In-Context and In-Weights Mixtures Via Contrastive Context Sampling", "description": "A novel training strategy for LLMs that balances in-context and in-weights learning by using contrastive context sampling to improve performance and prevent label copying.", "url": "https://sciencetostartup.com/paper/training-in-context-and-in-weights-mixtures-via-contrastive-context-sampling", "sameAs": "https://arxiv.org/abs/2604.01601", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.01601" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-02T04:21:42.000Z", "author": [ { "@type": "Person", "name": "Deeptanshu Malu" }, { "@type": "Person", "name": "Deevyanshu Malu" }, { "@type": "Person", "name": "Aditya Nemiwal" }, { "@type": "Person", "name": "Sunita Sarawagi" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 3 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Training" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Training", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Training In-Context and In-Weights Mixtures Via Contrastive ", "item": "https://sciencetostartup.com/paper/training-in-context-and-in-weights-mixtures-via-contrastive-context-sampling" } ] } ] }

Competitive landscape

A novel training strategy for LLMs that balances in-context and in-weights learning by using contrastive context sampling to improve performance and prevent label copying.

Segment

LLM Training

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Training In-Context and In-Weights Mixtures Via Contrastive Context Sampling

Training In-Context and In-Weights Mixtures Via Contrastive Context Sampling

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline