ARXIV:2604.01687 · LLM AGENTS · SUBMITTED 03 APR · 20:50 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

EvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification

Hanrong Zhang · Shicheng Fan · Henry Peng Zou · Yankai Chen · Zhenting Wang · Jiayu Zhou · +7 at arXiv

A framework for LLM agents to autonomously generate complex, multi-file skills for professional tasks, improving performance and reducing manual effort.

Blocked on Code›Score5.0Evidence unverified

Opportunity summary

Pain A framework for LLM agents to autonomously generate complex, multi-file skills for professional tasks, improving performance and reducing manual effort.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A framework for LLM agents to autonomously generate complex, multi-file skills for professional tasks, improving performance and reducing manual effort. A tool is a single, self-contained function, whereas a skill is a structured bundle…

METHOD

Full abstract

Anthropic proposes the concept of skills for LLM agents to tackle multi-step professional tasks that simple tool invocations cannot address. A tool is a single, self-contained function, whereas a skill is a structured bundle of interdependent multi-file artifacts. Currently, skill generation is not only label-intensive due to manual authoring, but also may suffer from human--machine cognitive misalignment, which can lead to degraded agent performance, as evidenced by evaluations on SkillsBench. Therefore, we aim to enable agents to autonomously generate skills. However, existing self-evolving methods designed for tools cannot be directly applied to skills due to their increased complexity. To address these issues, we propose EvoSkills, a self-evolving skills framework that enables agents to autonomously construct complex, multi-file skill packages. Specifically, EvoSkills couples a Skill Generator that iteratively refines skills with a Surrogate Verifier that co-evolves to provide informative and actionable feedback without access to ground-truth test content. On SkillsBench, EvoSkills achieves the highest pass rate among five baselines on both Claude Code and Codex, and also exhibits strong generalization capabilities to six additional LLMs.

RESULT

ScienceToStartup currently rates this 5.0/10 on the public viability pass. Therefore, we aim to enable agents to autonomously generate skills.

WHY NOW

LLM Agents moved forward this cycle; last verified April 2026. Public score 5.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score5.0

PainA framework for LLM agents to autonomously generate complex, multi-file skills for professional tasks, improving performance and reducing manual effort.

Evidence0 refs | 0 sources | 33% coverage

Blockerno shell-level blocker reported

Analysis summary

A framework for LLM agents to autonomously generate complex, multi-file skills for professional tasks, improving performance and reducing manual effort.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A framework for LLM agents to autonomously generate complex, multi-file skills for professional tasks, improving performance and reducing manual effort.

Segment

LLM Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "9c4b4b9a-6e54-4e9d-bb84-ac3e50f5e54e", "arxiv_id": "2604.01687", "canonical_route": "/paper/evoskills-self-evolving-agent-skills-via-co-evolutionary-verification", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "evoskills-self-evolving-agent-skills-via-co-evolutionary-verification", "endpoints": { "paper_pack": "/api/v1/paper/evoskills-self-evolving-agent-skills-via-co-evolutionary-verification/paper-pack", "build_passport": "/api/v1/paper/evoskills-self-evolving-agent-skills-via-co-evolutionary-verification/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "EvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification", "normalized_query": "2604.01687", "route": "/paper/evoskills-self-evolving-agent-skills-via-co-evolutionary-verification", "paper_ref": "evoskills-self-evolving-agent-skills-via-co-evolutionary-verification", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/evoskills-self-evolving-agent-skills-via-co-evolutionary-verification#webpage", "url": "https://sciencetostartup.com/paper/evoskills-self-evolving-agent-skills-via-co-evolutionary-verification", "name": "EvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification", "description": "A framework for LLM agents to autonomously generate complex, multi-file skills for professional tasks, improving performance and reducing manual effort.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/evoskills-self-evolving-agent-skills-via-co-evolutionary-verification#scholarlyArticle", "headline": "EvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification", "description": "A framework for LLM agents to autonomously generate complex, multi-file skills for professional tasks, improving performance and reducing manual effort.", "url": "https://sciencetostartup.com/paper/evoskills-self-evolving-agent-skills-via-co-evolutionary-verification", "sameAs": "https://arxiv.org/abs/2604.01687", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.01687" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-02T06:43:20.000Z", "author": [ { "@type": "Person", "name": "Hanrong Zhang" }, { "@type": "Person", "name": "Shicheng Fan" }, { "@type": "Person", "name": "Henry Peng Zou" }, { "@type": "Person", "name": "Yankai Chen" }, { "@type": "Person", "name": "Zhenting Wang" }, { "@type": "Person", "name": "Jiayu Zhou" }, { "@type": "Person", "name": "Chengze Li" }, { "@type": "Person", "name": "Wei-Chieh Huang" }, { "@type": "Person", "name": "Yifei Yao" }, { "@type": "Person", "name": "Kening Zheng" }, { "@type": "Person", "name": "Xue Liu" }, { "@type": "Person", "name": "Xiaoxiao Li" }, { "@type": "Person", "name": "Philip S. Yu" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 5 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Agents" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "EvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Ve", "item": "https://sciencetostartup.com/paper/evoskills-self-evolving-agent-skills-via-co-evolutionary-verification" } ] } ] }

Competitive landscape

A framework for LLM agents to autonomously generate complex, multi-file skills for professional tasks, improving performance and reducing manual effort.

Segment

LLM Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

EvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification

EvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline