ARXIV:2605.20477 · AGENTS · SUBMITTED 21 MAY · 20:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Training Language Agents to Learn from Experience

Yuval Shalev · Zifeng Ding · Mateja Jamnik · arXiv

A framework and training pipeline for language agents to learn reusable lessons from experience, enabling cross-task self-improvement and generalization.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A framework and training pipeline for language agents to learn reusable lessons from experience, enabling cross-task self-improvement and generalization.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A framework and training pipeline for language agents to learn reusable lessons from experience, enabling cross-task self-improvement and generalization. Whether such experience can be distilled into reusable lessons that improve performance on future unseen…

METHOD

Full abstract

Language agents can adapt from experience in interactive environments, but current reflection-based methods can only self-correct within a single task instance. Whether such experience can be distilled into reusable lessons that improve performance on future unseen tasks remains unclear. We address this problem by introducing the In-context Training (ICT) task, a framework for evaluating cross-task self-improvement in language agents. In ICT, a reflector model observes trajectories collected by an actor model and generates system prompts intended to improve the actor's performance on future unseen tasks. We then propose an RL-based training pipeline for learning such reflections directly from experience, without human-provided examples. Across ALFWorld and MiniHack, our trained reflectors outperform an untrained baseline on most held-out task families, showing that the ability to learn from experience can itself be learned. In some cases, we observe generalisation beyond the benchmark on which the reflector was trained, to substantially different environments. Finally, we introduce MetaGym, a generic Python library for constructing meta-environments, enabling future research on self-improving language agents.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Whether such experience can be distilled into reusable lessons that improve performance on future unseen tasks remains unclear. Code availability is flagged in the…

WHY NOW

Agents moved forward this cycle; last verified May 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA framework and training pipeline for language agents to learn reusable lessons from experience, enabling cross-task self-improvement and generalization.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A framework and training pipeline for language agents to learn reusable lessons from experience, enabling cross-task self-improvement and generalization.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A framework and training pipeline for language agents to learn reusable lessons from experience, enabling cross-task self-improvement and generalization.

Segment

Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "ff509b0f-fd40-42e5-b3c9-0563e0571598", "arxiv_id": "2605.20477", "canonical_route": "/paper/training-language-agents-to-learn-from-experience", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "training-language-agents-to-learn-from-experience", "endpoints": { "paper_pack": "/api/v1/paper/training-language-agents-to-learn-from-experience/paper-pack", "build_passport": "/api/v1/paper/training-language-agents-to-learn-from-experience/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Training Language Agents to Learn from Experience", "normalized_query": "2605.20477", "route": "/paper/training-language-agents-to-learn-from-experience", "paper_ref": "training-language-agents-to-learn-from-experience", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/training-language-agents-to-learn-from-experience#webpage", "url": "https://sciencetostartup.com/paper/training-language-agents-to-learn-from-experience", "name": "Training Language Agents to Learn from Experience", "description": "A framework and training pipeline for language agents to learn reusable lessons from experience, enabling cross-task self-improvement and generalization.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/training-language-agents-to-learn-from-experience#scholarlyArticle", "headline": "Training Language Agents to Learn from Experience", "description": "A framework and training pipeline for language agents to learn reusable lessons from experience, enabling cross-task self-improvement and generalization.", "url": "https://sciencetostartup.com/paper/training-language-agents-to-learn-from-experience", "sameAs": "https://arxiv.org/abs/2605.20477", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.20477" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-19T20:41:12.000Z", "author": [ { "@type": "Person", "name": "Yuval Shalev" }, { "@type": "Person", "name": "Zifeng Ding" }, { "@type": "Person", "name": "Mateja Jamnik" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Agents" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Training Language Agents to Learn from Experience", "item": "https://sciencetostartup.com/paper/training-language-agents-to-learn-from-experience" } ] } ] }

Competitive landscape

A framework and training pipeline for language agents to learn reusable lessons from experience, enabling cross-task self-improvement and generalization.

Segment

Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Training Language Agents to Learn from Experience

Training Language Agents to Learn from Experience

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline