ARXIV:2604.06811 · AGENT SECURITY · SUBMITTED 10 APR · 00:14 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems

Yunhao Feng · Yifan Ding · Yingshui Tan · Boren Zheng · Yanming Guo · Xiaolong Li · +3 at arXiv

SkillTrojan introduces a novel backdoor attack on skill-based agent systems, demonstrating high attack success with minimal performance degradation.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain SkillTrojan introduces a novel backdoor attack on skill-based agent systems, demonstrating high attack success with minimal performance degradation.

Evidence 49 refs | 3 sources | 67% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

SkillTrojan introduces a novel backdoor attack on skill-based agent systems, demonstrating high attack success with minimal performance degradation. We propose SkillTrojan, a backdoor attack that targets skill implementations rather than model parameters or training…

METHOD

Full abstract

Skill-based agent systems tackle complex tasks by composing reusable skills, improving modularity and scalability while introducing a largely unexamined security attack surface. We propose SkillTrojan, a backdoor attack that targets skill implementations rather than model parameters or training data. SkillTrojan embeds malicious logic inside otherwise plausible skills and leverages standard skill composition to reconstruct and execute an attacker-specified payload. The attack partitions an encrypted payload across multiple benign-looking skill invocations and activates only under a predefined trigger. SkillTrojan also supports automated synthesis of backdoored skills from arbitrary skill templates, enabling scalable propagation across skill-based agent ecosystems. To enable systematic evaluation, we release a dataset of 3,000+ curated backdoored skills spanning diverse skill patterns and trigger-payload configurations. We instantiate SkillTrojan in a representative code-based agent setting and evaluate both clean-task utility and attack success rate. Our results show that skill-level backdoors can be highly effective with minimal degradation of benign behavior, exposing a critical blind spot in current skill-based agent architectures and motivating defenses that explicitly reason about skill composition and execution. Concretely, on EHR SQL, SkillTrojan attains up to 97.2% ASR while maintaining 89.3% clean ACC on GPT-5.2-1211-Global.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. SkillTrojan also supports automated synthesis of backdoored skills from arbitrary skill templates, enabling scalable propagation across skill-based agent ecosystems. Code availability is flagged in…

WHY NOW

Agent Security moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainSkillTrojan introduces a novel backdoor attack on skill-based agent systems, demonstrating high attack success with minimal performance degradation.

Evidence49 refs | 3 sources | 67% coverage

Blockerno shell-level blocker reported

Analysis summary

SkillTrojan introduces a novel backdoor attack on skill-based agent systems, demonstrating high attack success with minimal performance degradation.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

SkillTrojan introduces a novel backdoor attack on skill-based agent systems, demonstrating high attack success with minimal performance degradation.

Segment

Agent Security

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "aa7f8aed-b16a-4257-a28e-951da67c0961", "arxiv_id": "2604.06811", "canonical_route": "/paper/skilltrojan-backdoor-attacks-on-skill-based-agent-systems", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "skilltrojan-backdoor-attacks-on-skill-based-agent-systems", "endpoints": { "paper_pack": "/api/v1/paper/skilltrojan-backdoor-attacks-on-skill-based-agent-systems/paper-pack", "build_passport": "/api/v1/paper/skilltrojan-backdoor-attacks-on-skill-based-agent-systems/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems", "normalized_query": "2604.06811", "route": "/paper/skilltrojan-backdoor-attacks-on-skill-based-agent-systems", "paper_ref": "skilltrojan-backdoor-attacks-on-skill-based-agent-systems", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/skilltrojan-backdoor-attacks-on-skill-based-agent-systems#webpage", "url": "https://sciencetostartup.com/paper/skilltrojan-backdoor-attacks-on-skill-based-agent-systems", "name": "SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems", "description": "SkillTrojan introduces a novel backdoor attack on skill-based agent systems, demonstrating high attack success with minimal performance degradation.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/skilltrojan-backdoor-attacks-on-skill-based-agent-systems#scholarlyArticle", "headline": "SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems", "description": "SkillTrojan introduces a novel backdoor attack on skill-based agent systems, demonstrating high attack success with minimal performance degradation.", "url": "https://sciencetostartup.com/paper/skilltrojan-backdoor-attacks-on-skill-based-agent-systems", "sameAs": "https://arxiv.org/abs/2604.06811", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.06811" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-08T08:24:48.000Z", "author": [ { "@type": "Person", "name": "Yunhao Feng" }, { "@type": "Person", "name": "Yifan Ding" }, { "@type": "Person", "name": "Yingshui Tan" }, { "@type": "Person", "name": "Boren Zheng" }, { "@type": "Person", "name": "Yanming Guo" }, { "@type": "Person", "name": "Xiaolong Li" }, { "@type": "Person", "name": "Kun Zhai" }, { "@type": "Person", "name": "Yishan Li" }, { "@type": "Person", "name": "Wenke Huang" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Agent Security" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Agent Security", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems", "item": "https://sciencetostartup.com/paper/skilltrojan-backdoor-attacks-on-skill-based-agent-systems" } ] } ] }

Competitive landscape

SkillTrojan introduces a novel backdoor attack on skill-based agent systems, demonstrating high attack success with minimal performance degradation.

Segment

Agent Security

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems

SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline