ARXIV:2605.14290 · AGENTS · SUBMITTED 15 MAY · 20:12 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Web Agents Should Adopt the Plan-Then-Execute Paradigm

Julien Piet · Annabella Chow · Yiwei Hou · Muxi Lyu · Sylvie Venuto · Jinhao Zhu · +2 at arXiv

Develop typed, auditable website APIs to enable plan-then-execute web agents that are more robust to prompt injection.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain Develop typed, auditable website APIs to enable plan-then-execute web agents that are more robust to prompt injection.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Develop typed, auditable website APIs to enable plan-then-execute web agents that are more robust to prompt injection. We argue that it is the wrong default for web agents.

METHOD

ReAct has become the default architecture across LLM agents, and many existing web agents follow this paradigm. We argue that it is the wrong default for web agents.

Full abstract

ReAct has become the default architecture across LLM agents, and many existing web agents follow this paradigm. We argue that it is the wrong default for web agents. Instead, web agents should default to plan-then-execute: commit to a task-specific program before observing runtime web content, then execute it. The reason is that web content mixes inputs from many parties. An e-commerce product page may combine a seller's listing, customer reviews and sponsored advertisements. Under ReAct, all of this content flows into the model when deciding on the next action, creating a direct path for prompt injections to steer the agent's control flow. Plan-then-execute changes this boundary: untrusted data may influence values or branches inside a predefined execution graph, but it cannot redefine the user task or cause the model to synthesize new actions at runtime. We analyze WebArena, a popular web agent benchmark, and find that all tasks are compatible with plan-then-execute, while 80% can be completed with a purely programmatic plan, without any runtime LLM subroutine. We identify the main barrier to adopting plan-then-execute on the web: For it to work well, tools must map cleanly to semantic actions, with effects known before execution, so agents have enough information to plan. The web does not naturally expose that interface. Browser tools such as click, type, and scroll have page-dependent meanings. Planning at this layer is near-sighted: the agent can only see actions on the current page, and later actions appear only after it acts. Closing this gap requires typed interfaces that turn website interactions from clicks and keystrokes to task-level operations. This is an infrastructure problem, not a modeling problem. Web tasks do not need reactivity by default; they need typed, complete, auditable website APIs.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Web tasks do not need reactivity by default; they need typed, complete, auditable website APIs. Code availability is flagged in the production record; the…

WHY NOW

Agents moved forward this cycle; last verified May 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainDevelop typed, auditable website APIs to enable plan-then-execute web agents that are more robust to prompt injection.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

Develop typed, auditable website APIs to enable plan-then-execute web agents that are more robust to prompt injection.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Develop typed, auditable website APIs to enable plan-then-execute web agents that are more robust to prompt injection.

Segment

Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "27582464-589d-4218-a7fc-536c274ef6ad", "arxiv_id": "2605.14290", "canonical_route": "/paper/web-agents-should-adopt-the-plan-then-execute-paradigm", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "web-agents-should-adopt-the-plan-then-execute-paradigm", "endpoints": { "paper_pack": "/api/v1/paper/web-agents-should-adopt-the-plan-then-execute-paradigm/paper-pack", "build_passport": "/api/v1/paper/web-agents-should-adopt-the-plan-then-execute-paradigm/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Web Agents Should Adopt the Plan-Then-Execute Paradigm", "normalized_query": "2605.14290", "route": "/paper/web-agents-should-adopt-the-plan-then-execute-paradigm", "paper_ref": "web-agents-should-adopt-the-plan-then-execute-paradigm", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/web-agents-should-adopt-the-plan-then-execute-paradigm#webpage", "url": "https://sciencetostartup.com/paper/web-agents-should-adopt-the-plan-then-execute-paradigm", "name": "Web Agents Should Adopt the Plan-Then-Execute Paradigm", "description": "Develop typed, auditable website APIs to enable plan-then-execute web agents that are more robust to prompt injection.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/web-agents-should-adopt-the-plan-then-execute-paradigm#scholarlyArticle", "headline": "Web Agents Should Adopt the Plan-Then-Execute Paradigm", "description": "Develop typed, auditable website APIs to enable plan-then-execute web agents that are more robust to prompt injection.", "url": "https://sciencetostartup.com/paper/web-agents-should-adopt-the-plan-then-execute-paradigm", "sameAs": "https://arxiv.org/abs/2605.14290", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.14290" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-14T02:48:57.000Z", "author": [ { "@type": "Person", "name": "Julien Piet" }, { "@type": "Person", "name": "Annabella Chow" }, { "@type": "Person", "name": "Yiwei Hou" }, { "@type": "Person", "name": "Muxi Lyu" }, { "@type": "Person", "name": "Sylvie Venuto" }, { "@type": "Person", "name": "Jinhao Zhu" }, { "@type": "Person", "name": "Raluca Ada Popa" }, { "@type": "Person", "name": "David Wagner" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Agents" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Web Agents Should Adopt the Plan-Then-Execute Paradigm", "item": "https://sciencetostartup.com/paper/web-agents-should-adopt-the-plan-then-execute-paradigm" } ] } ] }

Competitive landscape

Develop typed, auditable website APIs to enable plan-then-execute web agents that are more robust to prompt injection.

Segment

Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Web Agents Should Adopt the Plan-Then-Execute Paradigm

Web Agents Should Adopt the Plan-Then-Execute Paradigm

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline