ARXIV:2603.24238 · AI FOR AUTONOMOUS SYSTEMS · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Decentralized End-to-End Multi-AAV Pursuit Using Predictive Spatio-Temporal Observation via Deep Reinforcement Learning

Yude Li · Zhexuan Zhou · Huizhe Li · Yanke Sun · Yenan Wu · Yichen Lai · +3 at arXiv

Develop an advanced aerial swarm pursuit system using deep reinforcement learning for autonomous navigation in cluttered environments.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain Develop an advanced aerial swarm pursuit system using deep reinforcement learning for autonomous navigation in cluttered environments.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Develop an advanced aerial swarm pursuit system using deep reinforcement learning for autonomous navigation in cluttered environments. Existing methods often rely on abstracted geometric features or privileged ground-truth states, and therefore sidestep perceptual uncertainty…

METHOD

Full abstract

Decentralized cooperative pursuit in cluttered environments is challenging for autonomous aerial swarms, especially under partial and noisy perception. Existing methods often rely on abstracted geometric features or privileged ground-truth states, and therefore sidestep perceptual uncertainty in real-world settings. We propose a decentralized end-to-end multi-agent reinforcement learning (MARL) framework that maps raw LiDAR observations directly to continuous control commands. Central to the framework is the Predictive Spatio-Temporal Observation (PSTO), an egocentric grid representation that aligns obstacle geometry with predictive adversarial intent and teammate motion in a unified, fixed-resolution projection. Built on PSTO, a single decentralized policy enables agents to navigate static obstacles, intercept dynamic targets, and maintain cooperative encirclement. Simulations demonstrate that the proposed method achieves superior capture efficiency and competitive success rates compared to state-of-the-art learning-based approaches relying on privileged obstacle information. Furthermore, the unified policy scales seamlessly across different team sizes without retraining. Finally, fully autonomous outdoor experiments validate the framework on a quadrotor swarm relying on only onboard sensing and computing.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Built on PSTO, a single decentralized policy enables agents to navigate static obstacles, intercept dynamic targets, and maintain cooperative encirclement. Code availability is flagged…

WHY NOW

AI for Autonomous Systems moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainDevelop an advanced aerial swarm pursuit system using deep reinforcement learning for autonomous navigation in cluttered environments.

Evidence0 refs | 0 sources | 17% coverage

Blockerno shell-level blocker reported

Analysis summary

Develop an advanced aerial swarm pursuit system using deep reinforcement learning for autonomous navigation in cluttered environments.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Develop an advanced aerial swarm pursuit system using deep reinforcement learning for autonomous navigation in cluttered environments.

Segment

AI for Autonomous Systems

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "14d52045-9045-45d7-8213-7003c665991c", "arxiv_id": "2603.24238", "canonical_route": "/paper/decentralized-end-to-end-multi-aav-pursuit-using-predictive-spatio-temporal-observation-via-deep-reinforcement-learning", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "decentralized-end-to-end-multi-aav-pursuit-using-predictive-spatio-temporal-observation-via-deep-reinforcement-learning", "endpoints": { "paper_pack": "/api/v1/paper/decentralized-end-to-end-multi-aav-pursuit-using-predictive-spatio-temporal-observation-via-deep-reinforcement-learning/paper-pack", "build_passport": "/api/v1/paper/decentralized-end-to-end-multi-aav-pursuit-using-predictive-spatio-temporal-observation-via-deep-reinforcement-learning/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Decentralized End-to-End Multi-AAV Pursuit Using Predictive Spatio-Temporal Observation via Deep Reinforcement Learning", "normalized_query": "2603.24238", "route": "/paper/decentralized-end-to-end-multi-aav-pursuit-using-predictive-spatio-temporal-observation-via-deep-reinforcement-learning", "paper_ref": "decentralized-end-to-end-multi-aav-pursuit-using-predictive-spatio-temporal-observation-via-deep-reinforcement-learning", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/decentralized-end-to-end-multi-aav-pursuit-using-predictive-spatio-temporal-observation-via-deep-reinforcement-learning#webpage", "url": "https://sciencetostartup.com/paper/decentralized-end-to-end-multi-aav-pursuit-using-predictive-spatio-temporal-observation-via-deep-reinforcement-learning", "name": "Decentralized End-to-End Multi-AAV Pursuit Using Predictive Spatio-Temporal Observation via Deep Reinforcement Learning", "description": "Develop an advanced aerial swarm pursuit system using deep reinforcement learning for autonomous navigation in cluttered environments.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/decentralized-end-to-end-multi-aav-pursuit-using-predictive-spatio-temporal-observation-via-deep-reinforcement-learning#scholarlyArticle", "headline": "Decentralized End-to-End Multi-AAV Pursuit Using Predictive Spatio-Temporal Observation via Deep Reinforcement Learning", "description": "Develop an advanced aerial swarm pursuit system using deep reinforcement learning for autonomous navigation in cluttered environments.", "url": "https://sciencetostartup.com/paper/decentralized-end-to-end-multi-aav-pursuit-using-predictive-spatio-temporal-observation-via-deep-reinforcement-learning", "sameAs": "https://arxiv.org/abs/2603.24238", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.24238" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-25T12:23:35.000Z", "author": [ { "@type": "Person", "name": "Yude Li", "affiliation": { "@type": "Organization", "name": "Harbin Institute of Technology, Shenzhen, Guangdong, China" } }, { "@type": "Person", "name": "Zhexuan Zhou", "affiliation": { "@type": "Organization", "name": "Harbin Institute of Technology, Shenzhen, Guangdong, China" } }, { "@type": "Person", "name": "Huizhe Li", "affiliation": { "@type": "Organization", "name": "Harbin Institute of Technology, Shenzhen, Guangdong, China" } }, { "@type": "Person", "name": "Yanke Sun", "affiliation": { "@type": "Organization", "name": "Harbin Institute of Technology, Shenzhen, Guangdong, China" } }, { "@type": "Person", "name": "Yenan Wu", "affiliation": { "@type": "Organization", "name": "Harbin Institute of Technology, Shenzhen, Guangdong, China" } }, { "@type": "Person", "name": "Yichen Lai", "affiliation": { "@type": "Organization", "name": "Harbin Institute of Technology, Shenzhen, Guangdong, China" } }, { "@type": "Person", "name": "Yiming Wang", "affiliation": { "@type": "Organization", "name": "Harbin Institute of Technology, Shenzhen, Guangdong, China" } }, { "@type": "Person", "name": "Youmin Gong", "affiliation": { "@type": "Organization", "name": "Harbin Institute of Technology, Shenzhen, Guangdong, China" } }, { "@type": "Person", "name": "Jie Mei", "affiliation": { "@type": "Organization", "name": "Harbin Institute of Technology, Shenzhen, Guangdong, China" } } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "AI for Autonomous Systems" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "AI for Autonomous Systems", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Decentralized End-to-End Multi-AAV Pursuit Using Predictive ", "item": "https://sciencetostartup.com/paper/decentralized-end-to-end-multi-aav-pursuit-using-predictive-spatio-temporal-observation-via-deep-reinforcement-learning" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"Decentralized End-to-End Multi-AAV Pursuit Using Predictive \"?", "acceptedAnswer": { "@type": "Answer", "text": "Develop an advanced aerial swarm pursuit system using deep reinforcement learning for autonomous navigation in cluttered environments." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "Transform this framework into an API or software suite for companies working with autonomous aerial systems to improve their navigation algorithms, especially in unpredictable settings." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "Deploy the solution in urban environments where drone swarms can effectively navigate and manage tasks like local package delivery or search and rescue operations, especially in areas with signal interference or occlusions." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "It improves upon current state-to-control paradigms that depend on clean sensor data, making aerial operations more resilient to environmental noise and occlusion issues." } } ] } ] }

Competitive landscape

Develop an advanced aerial swarm pursuit system using deep reinforcement learning for autonomous navigation in cluttered environments.

Segment

AI for Autonomous Systems

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Decentralized End-to-End Multi-AAV Pursuit Using Predictive Spatio-Temporal Observation via Deep Reinforcement Learning

Decentralized End-to-End Multi-AAV Pursuit Using Predictive Spatio-Temporal Observation via Deep Reinforcement Learning

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline