ARXIV:2603.01940 · INTERACTIVE AI AGENTS · SUBMITTED 19 MAR · 21:31 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification

arXiv

CoVe offers a robust framework for generating high-quality training data for interactive tool-use agents, outperforming larger models in complex multi-turn interactions.

Blocked on Code›Score8.0Evidence unverified

Opportunity summary

Pain CoVe offers a robust framework for generating high-quality training data for interactive tool-use agents, outperforming larger models in complex multi-turn interactions.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

CoVe offers a robust framework for generating high-quality training data for interactive tool-use agents, outperforming larger models in complex multi-turn interactions. To address this gap, we introduce \textbf{CoVe} (\textbf{Co}nstraint-\textbf{Ve}rification), a post-training data synthesis framework…

METHOD

Full abstract

Developing multi-turn interactive tool-use agents is challenging because real-world user needs are often complex and ambiguous, yet agents must execute deterministic actions to satisfy them. To address this gap, we introduce \textbf{CoVe} (\textbf{Co}nstraint-\textbf{Ve}rification), a post-training data synthesis framework designed for training interactive tool-use agents while ensuring both data complexity and correctness. CoVe begins by defining explicit task constraints, which serve a dual role: they guide the generation of complex trajectories and act as deterministic verifiers for assessing trajectory quality. This enables the creation of high-quality training trajectories for supervised fine-tuning (SFT) and the derivation of accurate reward signals for reinforcement learning (RL). Our evaluation on the challenging $τ^2$-bench benchmark demonstrates the effectiveness of the framework. Notably, our compact \textbf{CoVe-4B} model achieves success rates of 43.0\% and 59.4\% in the Airline and Retail domains, respectively; its overall performance significantly outperforms strong baselines of similar scale and remains competitive with models up to $17\times$ its size. These results indicate that CoVe provides an effective and efficient pathway for synthesizing training data for state-of-the-art interactive tool-use agents. To support future research, we open-source our code, trained model, and the full set of 12K high-quality trajectories used for training.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. This enables the creation of high-quality training trajectories for supervised fine-tuning (SFT) and the derivation of accurate reward signals for reinforcement learning (RL).

WHY NOW

Interactive AI Agents moved forward this cycle; last verified April 2026. Public score 8.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainCoVe offers a robust framework for generating high-quality training data for interactive tool-use agents, outperforming larger models in complex multi-turn interactions.

Evidence0 refs | 0 sources | 33% coverage

Blockermissing authors

Analysis summary

CoVe offers a robust framework for generating high-quality training data for interactive tool-use agents, outperforming larger models in complex multi-turn interactions.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

CoVe offers a robust framework for generating high-quality training data for interactive tool-use agents, outperforming larger models in complex multi-turn interactions.

Segment

Interactive AI Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "63768757-8d2b-4f6f-9eae-9ce8ee561248", "arxiv_id": "2603.01940", "canonical_route": "/paper/cove-training-interactive-tool-use-agents-via-constraint-guided-verification", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "cove-training-interactive-tool-use-agents-via-constraint-guided-verification", "endpoints": { "paper_pack": "/api/v1/paper/cove-training-interactive-tool-use-agents-via-constraint-guided-verification/paper-pack", "build_passport": "/api/v1/paper/cove-training-interactive-tool-use-agents-via-constraint-guided-verification/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification", "normalized_query": "2603.01940", "route": "/paper/cove-training-interactive-tool-use-agents-via-constraint-guided-verification", "paper_ref": "cove-training-interactive-tool-use-agents-via-constraint-guided-verification", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/cove-training-interactive-tool-use-agents-via-constraint-guided-verification#webpage", "url": "https://sciencetostartup.com/paper/cove-training-interactive-tool-use-agents-via-constraint-guided-verification", "name": "CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification", "description": "CoVe offers a robust framework for generating high-quality training data for interactive tool-use agents, outperforming larger models in complex multi-turn interactions.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/cove-training-interactive-tool-use-agents-via-constraint-guided-verification#scholarlyArticle", "headline": "CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification", "description": "CoVe offers a robust framework for generating high-quality training data for interactive tool-use agents, outperforming larger models in complex multi-turn interactions.", "url": "https://sciencetostartup.com/paper/cove-training-interactive-tool-use-agents-via-constraint-guided-verification", "sameAs": "https://arxiv.org/abs/2603.01940", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.01940" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-02T14:56:35.000Z", "author": [ { "@type": "Person", "name": "Jinpeng Chen", "affiliation": { "@type": "Organization", "name": "Huawei Research" } }, { "@type": "Person", "name": "Cheng Gong", "affiliation": { "@type": "Organization", "name": "Huawei Research" } }, { "@type": "Person", "name": "Rui Liu", "affiliation": { "@type": "Organization", "name": "Huawei Research" } }, { "@type": "Person", "name": "Hanbo Li", "affiliation": { "@type": "Organization", "name": "Independent Researcher" } } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Interactive AI Agents" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Interactive AI Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "CoVe: Training Interactive Tool-Use Agents via Constraint-Gu", "item": "https://sciencetostartup.com/paper/cove-training-interactive-tool-use-agents-via-constraint-guided-verification" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"CoVe: Training Interactive Tool-Use Agents via Constraint-Gu\"?", "acceptedAnswer": { "@type": "Answer", "text": "CoVe offers a robust framework for generating high-quality training data for interactive tool-use agents, outperforming larger models in complex multi-turn interactions." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "Leverage the CoVe framework to build an API that e-commerce platforms can integrate for automated smart chatbots, enhancing customer service capabilities." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "Develop a virtual customer support agent for e-commerce platforms that can handle complex, interactive queries efficiently, reducing the need for human intervention." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "Replaces traditional customer service algorithms that struggle with complex, multi-turn interactions and often require substantial human intervention." } } ] } ] }

Competitive landscape

CoVe offers a robust framework for generating high-quality training data for interactive tool-use agents, outperforming larger models in complex multi-turn interactions.

Segment

Interactive AI Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification

CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline