ARXIV:2603.05294 · AGENTS · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks

arXiv

Develop a hierarchical planning framework, STRUCTUREDAGENT, to enhance long-horizon web task efficiency using AND/OR trees.

Blocked on Code›Score7.0Evidence unverified

Opportunity summary

Pain Develop a hierarchical planning framework, STRUCTUREDAGENT, to enhance long-horizon web task efficiency using AND/OR trees.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Develop a hierarchical planning framework, STRUCTUREDAGENT, to enhance long-horizon web task efficiency using AND/OR trees. Such agents must perceive their environment, reason across multiple time steps, and take actions that optimize long-term objectives.

METHOD

Full abstract

Recent advances in large language models (LLMs) have enabled agentic systems for sequential decision-making. Such agents must perceive their environment, reason across multiple time steps, and take actions that optimize long-term objectives. However, existing web agents struggle on complex, long-horizon tasks due to limited in-context memory for tracking history, weak planning abilities, and greedy behaviors that lead to premature termination. To address these challenges, we propose STRUCTUREDAGENT, a hierarchical planning framework with two core components: (1) an online hierarchical planner that uses dynamic AND/OR trees for efficient search and (2) a structured memory module that tracks and maintains candidate solutions to improve constraint satisfaction in information-seeking tasks. The framework also produces interpretable hierarchical plans, enabling easier debugging and facilitating human intervention when needed. Our results on WebVoyager, WebArena, and custom shopping benchmarks show that STRUCTUREDAGENT improves performance on long-horizon web-browsing tasks compared to standard LLM-based agents.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. To address these challenges, we propose STRUCTUREDAGENT, a hierarchical planning framework with two core components: (1) an online hierarchical planner that uses dynamic AND/OR…

WHY NOW

Agents moved forward this cycle; last verified April 2026. Public score 7.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainDevelop a hierarchical planning framework, STRUCTUREDAGENT, to enhance long-horizon web task efficiency using AND/OR trees.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

Develop a hierarchical planning framework, STRUCTUREDAGENT, to enhance long-horizon web task efficiency using AND/OR trees.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

Develop a hierarchical planning framework, STRUCTUREDAGENT, to enhance long-horizon web task efficiency using AND/OR trees.

Segment

Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(14)

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

2025Thomas Schmied, Jörg Bornschein et al.

Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks

2025Lutfi Eren Erdogan, Nicholas Lee et al.

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

2024Zehan Qi, Xiao Liu et al.

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents

2024Ke Yang, Yao Liu et al.

Agent Workflow Memory

2024Z. Z. Wang, Jiayuan Mao et al.

WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration

2024Yao Zhang, Zijian Ma et al.

Tree Search for Language Model Agents

2024Jing Yu Koh, S. McAleer et al.

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

2024Hongliang He, Wenlin Yao et al.

VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks

2024Jing Yu Koh, Robert Lo et al.

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

2023Andy Zhou, Kai Yan et al.

SteP: Stacked LLM Policies for Web Actions

2023Paloma Sodhi, S. Branavan et al.

WebArena: A Realistic Web Environment for Building Autonomous Agents

2023Shuyan Zhou, Frank F. Xu et al.

Reflexion: language agents with verbal reinforcement learning

2023Noah Shinn, Federico Cassano et al.

Planning and Acting in Partially Observable Stochastic Domains

1998L. Kaelbling, M. Littman et al.

{ "contract_version": "paper-r2", "paper_id": "997f1a47-ad79-40a2-aff6-82925cf11a07", "arxiv_id": "2603.05294", "canonical_route": "/paper/structuredagent-planning-with-and-or-trees-for-long-horizon-web-tasks", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "structuredagent-planning-with-and-or-trees-for-long-horizon-web-tasks", "endpoints": { "paper_pack": "/api/v1/paper/structuredagent-planning-with-and-or-trees-for-long-horizon-web-tasks/paper-pack", "build_passport": "/api/v1/paper/structuredagent-planning-with-and-or-trees-for-long-horizon-web-tasks/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks", "normalized_query": "2603.05294", "route": "/paper/structuredagent-planning-with-and-or-trees-for-long-horizon-web-tasks", "paper_ref": "structuredagent-planning-with-and-or-trees-for-long-horizon-web-tasks", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/structuredagent-planning-with-and-or-trees-for-long-horizon-web-tasks#webpage", "url": "https://sciencetostartup.com/paper/structuredagent-planning-with-and-or-trees-for-long-horizon-web-tasks", "name": "STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks", "description": "Develop a hierarchical planning framework, STRUCTUREDAGENT, to enhance long-horizon web task efficiency using AND/OR trees.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/structuredagent-planning-with-and-or-trees-for-long-horizon-web-tasks#scholarlyArticle", "headline": "STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks", "description": "Develop a hierarchical planning framework, STRUCTUREDAGENT, to enhance long-horizon web task efficiency using AND/OR trees.", "url": "https://sciencetostartup.com/paper/structuredagent-planning-with-and-or-trees-for-long-horizon-web-tasks", "sameAs": "https://arxiv.org/abs/2603.05294", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.05294" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-05T15:37:06.000Z", "citation": [ { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "a0233079180eaea0b5b43573a595864814a053b5" }, "url": "https://www.semanticscholar.org/paper/a0233079180eaea0b5b43573a595864814a053b5" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "81d0a2c6001e2b9a8770d36737ec4022436a9e4c" }, "url": "https://www.semanticscholar.org/paper/81d0a2c6001e2b9a8770d36737ec4022436a9e4c" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "bde841b0dbbf7a15ee69966a828c7fe2cf532ad9" }, "url": "https://www.semanticscholar.org/paper/bde841b0dbbf7a15ee69966a828c7fe2cf532ad9" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "2d80b8305ac9297d35d085895f8b8d3984731dce" }, "url": "https://www.semanticscholar.org/paper/2d80b8305ac9297d35d085895f8b8d3984731dce" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "c68cc84ec7808d7bbd5686a6bd1393752a9d8e8d" }, "url": "https://www.semanticscholar.org/paper/c68cc84ec7808d7bbd5686a6bd1393752a9d8e8d" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "bf27f8cc6737700934f214180f5e62e71338b347" }, "url": "https://www.semanticscholar.org/paper/bf27f8cc6737700934f214180f5e62e71338b347" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "9345e55a21959948499cee997522aa5eac7ed588" }, "url": "https://www.semanticscholar.org/paper/9345e55a21959948499cee997522aa5eac7ed588" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "19261c6ad20c6c1e5585a8afcb88196173cbc8a6" }, "url": "https://www.semanticscholar.org/paper/19261c6ad20c6c1e5585a8afcb88196173cbc8a6" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "700bd9681f1b9e9e2212e10415d27b11c7e6836b" }, "url": "https://www.semanticscholar.org/paper/700bd9681f1b9e9e2212e10415d27b11c7e6836b" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "268e28f8d5235031dcd7bfae0f857439e27e8564" }, "url": "https://www.semanticscholar.org/paper/268e28f8d5235031dcd7bfae0f857439e27e8564" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "e41482f4ee984f17382f6cdd900df094d928be06" }, "url": "https://www.semanticscholar.org/paper/e41482f4ee984f17382f6cdd900df094d928be06" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "0671fd553dd670a4e820553a974bc48040ba0819" }, "url": "https://www.semanticscholar.org/paper/0671fd553dd670a4e820553a974bc48040ba0819" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "116d7798c1123cf7fad4176e98f58fd49de4f8f1" }, "url": "https://www.semanticscholar.org/paper/116d7798c1123cf7fad4176e98f58fd49de4f8f1" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "f554b22d2ccf786a6d61d5858f43024ba9115e15" }, "url": "https://www.semanticscholar.org/paper/f554b22d2ccf786a6d61d5858f43024ba9115e15" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Agents" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon", "item": "https://sciencetostartup.com/paper/structuredagent-planning-with-and-or-trees-for-long-horizon-web-tasks" } ] } ] }

Competitive landscape

Develop a hierarchical planning framework, STRUCTUREDAGENT, to enhance long-horizon web task efficiency using AND/OR trees.

Segment

Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(14)

LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities

2025Thomas Schmied, Jörg Bornschein et al.

Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks

2025Lutfi Eren Erdogan, Nicholas Lee et al.

WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning

2024Zehan Qi, Xiao Liu et al.

AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agents

2024Ke Yang, Yao Liu et al.

Agent Workflow Memory

2024Z. Z. Wang, Jiayuan Mao et al.

WebPilot: A Versatile and Autonomous Multi-Agent System for Web Task Execution with Strategic Exploration

2024Yao Zhang, Zijian Ma et al.

Tree Search for Language Model Agents

2024Jing Yu Koh, S. McAleer et al.

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

2024Hongliang He, Wenlin Yao et al.

VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks

2024Jing Yu Koh, Robert Lo et al.

Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models

2023Andy Zhou, Kai Yan et al.

SteP: Stacked LLM Policies for Web Actions

2023Paloma Sodhi, S. Branavan et al.

WebArena: A Realistic Web Environment for Building Autonomous Agents

2023Shuyan Zhou, Frank F. Xu et al.

Reflexion: language agents with verbal reinforcement learning

2023Noah Shinn, Federico Cassano et al.

Planning and Acting in Partially Observable Stochastic Domains

1998L. Kaelbling, M. Littman et al.

STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks

STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(14)

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(14)

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline