ARXIV:2604.03131 · AI AGENT SECURITY · SUBMITTED 06 APR · 20:16 UTC · FRESHNESS UNKNOWN

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

A Systematic Security Evaluation of OpenClaw and Its Variants

Yuhang Wang · Haichang Gao · Zhenxing Niu · Zhaoxiang Liu · Wenjing Zhang · Xiang Wang · +1 at arXiv

This research systematically evaluates security vulnerabilities in AI agent frameworks, revealing critical risks in tool use and multi-step planning that require lifecycle-wide governance.

Ship in 2-4 weeks›Score4.0Evidence unverified

Opportunity summary

Pain This research systematically evaluates security vulnerabilities in AI agent frameworks, revealing critical risks in tool use and multi-step planning that require lifecycle-wide governance.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

This research systematically evaluates security vulnerabilities in AI agent frameworks, revealing critical risks in tool use and multi-step planning that require lifecycle-wide governance. In this paper, we present a systematic security assessment of six…

METHOD

Full abstract

Tool-augmented AI agents substantially extend the practical capabilities of large language models, but they also introduce security risks that cannot be identified through model-only evaluation. In this paper, we present a systematic security assessment of six representative OpenClaw-series agent frameworks, namely OpenClaw, AutoClaw, QClaw, KimiClaw, MaxClaw, and ArkClaw, under multiple backbone models. To support this study, we construct a benchmark of 205 test cases covering representative attack behaviors across the full agent execution lifecycle, enabling unified evaluation of risk exposure at both the framework and model levels. Our results show that all evaluated agents exhibit substantial security vulnerabilities, and that agentized systems are significantly riskier than their underlying models used in isolation. In particular, reconnaissance and discovery behaviors emerge as the most common weaknesses, while different frameworks expose distinct high-risk profiles, including credential leakage, lateral movement, privilege escalation, and resource development. These findings indicate that the security of modern agent systems is shaped not only by the safety properties of the backbone model, but also by the coupling among model capability, tool use, multi-step planning, and runtime orchestration. We further show that once an agent is granted execution capability and persistent runtime context, weaknesses arising in early stages can be amplified into concrete system-level failures. Overall, our study highlights the need to move beyond prompt-level safeguards toward lifecycle-wide security governance for intelligent agent frameworks.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. To support this study, we construct a benchmark of 205 test cases covering representative attack behaviors across the full agent execution lifecycle, enabling unified…

WHY NOW

AI Agent Security moved forward this cycle; last verified April 2026. Public score 4.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainThis research systematically evaluates security vulnerabilities in AI agent frameworks, revealing critical risks in tool use and multi-step planning that require lifecycle-wide governance.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

This research systematically evaluates security vulnerabilities in AI agent frameworks, revealing critical risks in tool use and multi-step planning that require lifecycle-wide governance.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

This research systematically evaluates security vulnerabilities in AI agent frameworks, revealing critical risks in tool use and multi-step planning that require lifecycle-wide governance.

Segment

AI Agent Security

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "d5e35245-178d-4ad0-bd40-cc1499a19348", "arxiv_id": "2604.03131", "canonical_route": "/paper/a-systematic-security-evaluation-of-openclaw-and-its-variants", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "a-systematic-security-evaluation-of-openclaw-and-its-variants", "endpoints": { "paper_pack": "/api/v1/paper/a-systematic-security-evaluation-of-openclaw-and-its-variants/paper-pack", "build_passport": "/api/v1/paper/a-systematic-security-evaluation-of-openclaw-and-its-variants/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "A Systematic Security Evaluation of OpenClaw and Its Variants", "normalized_query": "2604.03131", "route": "/paper/a-systematic-security-evaluation-of-openclaw-and-its-variants", "paper_ref": "a-systematic-security-evaluation-of-openclaw-and-its-variants", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/a-systematic-security-evaluation-of-openclaw-and-its-variants#webpage", "url": "https://sciencetostartup.com/paper/a-systematic-security-evaluation-of-openclaw-and-its-variants", "name": "A Systematic Security Evaluation of OpenClaw and Its Variants", "description": "This research systematically evaluates security vulnerabilities in AI agent frameworks, revealing critical risks in tool use and multi-step planning that require lifecycle-wide governance.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/a-systematic-security-evaluation-of-openclaw-and-its-variants#scholarlyArticle", "headline": "A Systematic Security Evaluation of OpenClaw and Its Variants", "description": "This research systematically evaluates security vulnerabilities in AI agent frameworks, revealing critical risks in tool use and multi-step planning that require lifecycle-wide governance.", "url": "https://sciencetostartup.com/paper/a-systematic-security-evaluation-of-openclaw-and-its-variants", "sameAs": "https://arxiv.org/abs/2604.03131", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.03131" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-03T15:52:36.000Z", "author": [ { "@type": "Person", "name": "Yuhang Wang" }, { "@type": "Person", "name": "Haichang Gao" }, { "@type": "Person", "name": "Zhenxing Niu" }, { "@type": "Person", "name": "Zhaoxiang Liu" }, { "@type": "Person", "name": "Wenjing Zhang" }, { "@type": "Person", "name": "Xiang Wang" }, { "@type": "Person", "name": "Shiguo Lian" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "AI Agent Security" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "AI Agent Security", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "A Systematic Security Evaluation of OpenClaw and Its Variant", "item": "https://sciencetostartup.com/paper/a-systematic-security-evaluation-of-openclaw-and-its-variants" } ] } ] }

Competitive landscape

This research systematically evaluates security vulnerabilities in AI agent frameworks, revealing critical risks in tool use and multi-step planning that require lifecycle-wide governance.

Segment

AI Agent Security

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

A Systematic Security Evaluation of OpenClaw and Its Variants

A Systematic Security Evaluation of OpenClaw and Its Variants

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline