ARXIV:2605.15030 · WEB AGENTS · SUBMITTED 15 MAY · 20:11 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections

Tri Cao · Yulin Chen · Hieu Cao · Yibo Li · Khoi Le · Thong Nguyen · +5 at arXiv

WARD provides robust and efficient defense for web agents against prompt injection attacks by using adversarial training and a large-scale dataset.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain WARD provides robust and efficient defense for web agents against prompt injection attacks by using adversarial training and a large-scale dataset.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

WARD provides robust and efficient defense for web agents against prompt injection attacks by using adversarial training and a large-scale dataset. Existing guard models still suffer from limited generalization to unseen domains and attack…

METHOD

Full abstract

Web agents can autonomously complete online tasks by interacting with websites, but their exposure to open web environments makes them vulnerable to prompt injection attacks embedded in HTML content or visual interfaces. Existing guard models still suffer from limited generalization to unseen domains and attack patterns, high false positive rates on benign content, reduced deployment efficiency due to added latency at each step, and vulnerability to adversarial attacks that evolve over time or directly target the guard itself. To address these limitations, we propose WARD (Web Agent Robust Defense against Prompt Injection), a practical guard model for secure and efficient web agents. WARD is built on WARD-Base, a large-scale dataset with around 177K samples collected from 719 high-traffic URLs and platforms, and WARD-PIG, a dedicated dataset designed for prompt injection attacks targeting the guard model. We further introduce A3T, an adaptive adversarial attack training framework that iteratively strengthens WARD through a memory-based attacker and guard co-evolution process. Extensive experiments show that WARD achieves nearly perfect recall on out-of-distribution benchmarks, maintains low false positive rates to preserve agent utility, remains robust against guard-targeted and adaptive attacks under substantial distribution shifts, and runs efficiently in parallel with the agent without introducing additional latency.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Extensive experiments show that WARD achieves nearly perfect recall on out-of-distribution benchmarks, maintains low false positive rates to preserve agent utility, remains robust against…

WHY NOW

Web Agents moved forward this cycle; last verified May 2026. Public score 7.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainWARD provides robust and efficient defense for web agents against prompt injection attacks by using adversarial training and a large-scale dataset.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

WARD provides robust and efficient defense for web agents against prompt injection attacks by using adversarial training and a large-scale dataset.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

WARD provides robust and efficient defense for web agents against prompt injection attacks by using adversarial training and a large-scale dataset.

Segment

Web Agents

Adoption evidence

Public code linked for build inspection

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "f0d2de58-bd2b-41f9-94aa-c9e2d5719def", "arxiv_id": "2605.15030", "canonical_route": "/paper/ward-adversarially-robust-defense-of-web-agents-against-prompt-injections", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "ward-adversarially-robust-defense-of-web-agents-against-prompt-injections", "endpoints": { "paper_pack": "/api/v1/paper/ward-adversarially-robust-defense-of-web-agents-against-prompt-injections/paper-pack", "build_passport": "/api/v1/paper/ward-adversarially-robust-defense-of-web-agents-against-prompt-injections/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections", "normalized_query": "2605.15030", "route": "/paper/ward-adversarially-robust-defense-of-web-agents-against-prompt-injections", "paper_ref": "ward-adversarially-robust-defense-of-web-agents-against-prompt-injections", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/ward-adversarially-robust-defense-of-web-agents-against-prompt-injections#webpage", "url": "https://sciencetostartup.com/paper/ward-adversarially-robust-defense-of-web-agents-against-prompt-injections", "name": "WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections", "description": "WARD provides robust and efficient defense for web agents against prompt injection attacks by using adversarial training and a large-scale dataset.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/ward-adversarially-robust-defense-of-web-agents-against-prompt-injections#scholarlyArticle", "headline": "WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections", "description": "WARD provides robust and efficient defense for web agents against prompt injection attacks by using adversarial training and a large-scale dataset.", "url": "https://sciencetostartup.com/paper/ward-adversarially-robust-defense-of-web-agents-against-prompt-injections", "sameAs": "https://arxiv.org/abs/2605.15030", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.15030" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-14T16:26:27.000Z", "author": [ { "@type": "Person", "name": "Tri Cao" }, { "@type": "Person", "name": "Yulin Chen" }, { "@type": "Person", "name": "Hieu Cao" }, { "@type": "Person", "name": "Yibo Li" }, { "@type": "Person", "name": "Khoi Le" }, { "@type": "Person", "name": "Thong Nguyen" }, { "@type": "Person", "name": "Yuexin Li" }, { "@type": "Person", "name": "Yufei He" }, { "@type": "Person", "name": "Yue Liu" }, { "@type": "Person", "name": "Shuicheng Yan" }, { "@type": "Person", "name": "Bryan Hooi" } ], "codeRepository": "https://github.com/caothientri2001vn/WARD-WebAgent", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Web Agents" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code, repo url" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/ward-adversarially-robust-defense-of-web-agents-against-prompt-injections#software", "name": "WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections - Source Code", "description": "WARD provides robust and efficient defense for web agents against prompt injection attacks by using adversarial training and a large-scale dataset.", "codeRepository": "https://github.com/caothientri2001vn/WARD-WebAgent", "url": "https://github.com/caothientri2001vn/WARD-WebAgent" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Web Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "WARD: Adversarially Robust Defense of Web Agents Against Pro", "item": "https://sciencetostartup.com/paper/ward-adversarially-robust-defense-of-web-agents-against-prompt-injections" } ] } ] }

Competitive landscape

WARD provides robust and efficient defense for web agents against prompt injection attacks by using adversarial training and a large-scale dataset.

Segment

Web Agents

Adoption evidence

Public code linked for build inspection

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections

WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline