ARXIV:2603.28116 · AUTONOMOUS DRIVING AI · SUBMITTED 31 MAR · 20:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

$AutoDrive\text{-}P^3$: Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning

Yuqi Ye · Zijian Zhang · Junhong Lin · Shangkun Sun · Changhao Peng · Wei Gao · arXiv

A unified framework for autonomous driving that integrates perception, prediction, and planning through chain-of-thought reasoning and reinforcement learning, achieving state-of-the-art performance.

Ship in 2-4 weeks›Score8.0Evidence unverified

Opportunity summary

Pain A unified framework for autonomous driving that integrates perception, prediction, and planning through chain-of-thought reasoning and reinforcement learning, achieving state-of-the-art performance.

Evidence 29 refs | 4 sources | 83% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A unified framework for autonomous driving that integrates perception, prediction, and planning through chain-of-thought reasoning and reinforcement learning, achieving state-of-the-art performance. However, current VLM-based approaches suffer from two major limitations: 1) Some VLMs directly…

METHOD

Full abstract

Vision-language models (VLMs) are increasingly being adopted for end-to-end autonomous driving systems due to their exceptional performance in handling long-tail scenarios. However, current VLM-based approaches suffer from two major limitations: 1) Some VLMs directly output planning results without chain-of-thought (CoT) reasoning, bypassing crucial perception and prediction stages which creates a significant domain gap and compromises decision-making capability; 2) Other VLMs can generate outputs for perception, prediction, and planning tasks but employ a fragmented decision-making approach where these modules operate separately, leading to a significant lack of synergy that undermines true planning performance. To address these limitations, we propose ${AutoDrive\text{-}P^3}$, a novel framework that seamlessly integrates $\textbf{P}$erception, $\textbf{P}$rediction, and $\textbf{P}$lanning through structured reasoning. We introduce the ${P^3\text{-}CoT}$ dataset to facilitate coherent reasoning and propose ${P^3\text{-}GRPO}$, a hierarchical reinforcement learning algorithm that provides progressive supervision across all three tasks. Specifically, ${AutoDrive\text{-}P^3}$ progressively generates CoT reasoning and answers for perception, prediction, and planning, where perception provides essential information for subsequent prediction and planning, while both perception and prediction collectively contribute to the final planning decisions, enabling safer and more interpretable autonomous driving. Additionally, to balance inference efficiency with performance, we introduce dual thinking modes: detailed thinking and fast thinking. Extensive experiments on both open-loop (nuScenes) and closed-loop (NAVSIMv1/v2) benchmarks demonstrate that our approach achieves state-of-the-art performance in planning tasks. Code is available at https://github.com/haha-yuki-haha/AutoDrive-P3.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. However, current VLM-based approaches suffer from two major limitations: 1) Some VLMs directly output planning results without chain-of-thought (CoT) reasoning, bypassing crucial perception and…

WHY NOW

Autonomous Driving AI moved forward this cycle; last verified April 2026. Public score 8.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainA unified framework for autonomous driving that integrates perception, prediction, and planning through chain-of-thought reasoning and reinforcement learning, achieving state-of-the-art performance.

Evidence29 refs | 4 sources | 83% coverage

Blockerno shell-level blocker reported

Analysis summary

A unified framework for autonomous driving that integrates perception, prediction, and planning through chain-of-thought reasoning and reinforcement learning, achieving state-of-the-art performance.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A unified framework for autonomous driving that integrates perception, prediction, and planning through chain-of-thought reasoning and reinforcement learning, achieving state-of-the-art performance.

Segment

Autonomous Driving AI

Adoption evidence

Public code linked for build inspection

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "5c05a720-4a83-44ad-b738-07eb904564ee", "arxiv_id": "2603.28116", "canonical_route": "/paper/autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning", "endpoints": { "paper_pack": "/api/v1/paper/autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning/paper-pack", "build_passport": "/api/v1/paper/autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "$AutoDrive\\text{-}P^3$: Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning", "normalized_query": "2603.28116", "route": "/paper/autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning", "paper_ref": "autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning#webpage", "url": "https://sciencetostartup.com/paper/autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning", "name": "$AutoDrive\\text{-}P^3$: Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning", "description": "A unified framework for autonomous driving that integrates perception, prediction, and planning through chain-of-thought reasoning and reinforcement learning, achieving state-of-the-art performance.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning#scholarlyArticle", "headline": "$AutoDrive\\text{-}P^3$: Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning", "description": "A unified framework for autonomous driving that integrates perception, prediction, and planning through chain-of-thought reasoning and reinforcement learning, achieving state-of-the-art performance.", "url": "https://sciencetostartup.com/paper/autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning", "sameAs": "https://arxiv.org/abs/2603.28116", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.28116" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-30T07:28:41.000Z", "author": [ { "@type": "Person", "name": "Yuqi Ye" }, { "@type": "Person", "name": "Zijian Zhang" }, { "@type": "Person", "name": "Junhong Lin" }, { "@type": "Person", "name": "Shangkun Sun" }, { "@type": "Person", "name": "Changhao Peng" }, { "@type": "Person", "name": "Wei Gao" } ], "codeRepository": "https://github.com/haha-yuki-haha/AutoDrive-P3", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Autonomous Driving AI" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code, repo url" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning#software", "name": "$AutoDrive\\text{-}P^3$: Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning - Source Code", "description": "A unified framework for autonomous driving that integrates perception, prediction, and planning through chain-of-thought reasoning and reinforcement learning, achieving state-of-the-art performance.", "codeRepository": "https://github.com/haha-yuki-haha/AutoDrive-P3", "url": "https://github.com/haha-yuki-haha/AutoDrive-P3" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Autonomous Driving AI", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "$AutoDrive\\text{-}P^3$: Unified Chain of Perception-Predicti", "item": "https://sciencetostartup.com/paper/autodrive-text-p-3-unified-chain-of-perception-prediction-planning-thought-via-reinforcement-fine-tuning" } ] } ] }

Competitive landscape

A unified framework for autonomous driving that integrates perception, prediction, and planning through chain-of-thought reasoning and reinforcement learning, achieving state-of-the-art performance.

Segment

Autonomous Driving AI

Adoption evidence

Public code linked for build inspection

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

$AutoDrive\text{-}P^3$: Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning

$AutoDrive\text{-}P^3$: Unified Chain of Perception-Prediction-Planning Thought via Reinforcement Fine-Tuning

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline