ARXIV:2603.05218 · AI FOR ENTERPRISE SEARCH · SUBMITTED 19 MAR · 18:48 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

KARL: Knowledge Agents via Reinforcement Learning

arXiv

KARL leverages innovative reinforcement learning for affordable, high-performance enterprise search agents.

Blocked on Code›Score8.0Evidence unverified

Opportunity summary

Pain KARL leverages innovative reinforcement learning for affordable, high-performance enterprise search agents.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

KARL leverages innovative reinforcement learning for affordable, high-performance enterprise search agents. Our work makes four core contributions.

METHOD

Full abstract

We present a system for training enterprise search agents via reinforcement learning that achieves state-of-the-art performance across a diverse suite of hard-to-verify agentic search tasks. Our work makes four core contributions. First, we introduce KARLBench, a multi-capability evaluation suite spanning six distinct search regimes, including constraint-driven entity search, cross-document report synthesis, tabular numerical reasoning, exhaustive entity retrieval, procedural reasoning over technical documentation, and fact aggregation over internal enterprise notes. Second, we show that models trained across heterogeneous search behaviors generalize substantially better than those optimized for any single benchmark. Third, we develop an agentic synthesis pipeline that employs long-horizon reasoning and tool use to generate diverse, grounded, and high-quality training data, with iterative bootstrapping from increasingly capable models. Fourth, we propose a new post-training paradigm based on iterative large-batch off-policy RL that is sample efficient, robust to train-inference engine discrepancies, and naturally extends to multi-task training with out-of-distribution generalization. Compared to Claude 4.6 and GPT 5.2, KARL is Pareto-optimal on KARLBench across cost-quality and latency-quality trade-offs, including tasks that were out-of-distribution during training. With sufficient test-time compute, it surpasses the strongest closed models. These results show that tailored synthetic data in combination with multi-task reinforcement learning enables cost-efficient and high-performing knowledge agents for grounded reasoning.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. We present a system for training enterprise search agents via reinforcement learning that achieves state-of-the-art performance across a diverse suite of hard-to-verify agentic search…

WHY NOW

AI for Enterprise Search moved forward this cycle; last verified April 2026. Public score 8.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainKARL leverages innovative reinforcement learning for affordable, high-performance enterprise search agents.

Evidence0 refs | 0 sources | 33% coverage

Blockermissing authors

Analysis summary

KARL leverages innovative reinforcement learning for affordable, high-performance enterprise search agents.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

KARL leverages innovative reinforcement learning for affordable, high-performance enterprise search agents.

Segment

AI for Enterprise Search

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "1f42d773-b989-47f7-b652-403779da293a", "arxiv_id": "2603.05218", "canonical_route": "/paper/karl-knowledge-agents-via-reinforcement-learning", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "karl-knowledge-agents-via-reinforcement-learning", "endpoints": { "paper_pack": "/api/v1/paper/karl-knowledge-agents-via-reinforcement-learning/paper-pack", "build_passport": "/api/v1/paper/karl-knowledge-agents-via-reinforcement-learning/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "KARL: Knowledge Agents via Reinforcement Learning", "normalized_query": "2603.05218", "route": "/paper/karl-knowledge-agents-via-reinforcement-learning", "paper_ref": "karl-knowledge-agents-via-reinforcement-learning", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/karl-knowledge-agents-via-reinforcement-learning#webpage", "url": "https://sciencetostartup.com/paper/karl-knowledge-agents-via-reinforcement-learning", "name": "KARL: Knowledge Agents via Reinforcement Learning", "description": "KARL leverages innovative reinforcement learning for affordable, high-performance enterprise search agents.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/karl-knowledge-agents-via-reinforcement-learning#scholarlyArticle", "headline": "KARL: Knowledge Agents via Reinforcement Learning", "description": "KARL leverages innovative reinforcement learning for affordable, high-performance enterprise search agents.", "url": "https://sciencetostartup.com/paper/karl-knowledge-agents-via-reinforcement-learning", "sameAs": "https://arxiv.org/abs/2603.05218", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.05218" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-05T14:30:25.000Z", "author": [ { "@type": "Person", "name": "Jonathan D. Chang", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Andrew Drozdov", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Shubham Toshniwal", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Owen Oertell", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Alexander Trott", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Jacob Portes", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Abhay Gupta", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Pallavi Koppol", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Ashutosh Baheti", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Sean Kulinski", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Ivan Zhou", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Irene Dea", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Krista Opsahl-Ong", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Simon Favreau-Lessard", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Sean Owen", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Jose Javier Gonzalez Ortiz", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Arnav Singhvi", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Xabi Andrade", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Cindy Wang", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Kartik Sreenivasan", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Sam Havens", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Jialu Liu", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Peyton DeNiro", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Wen Sun", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Michael Bendersky", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } }, { "@type": "Person", "name": "Jonathan Frankle", "affiliation": { "@type": "Organization", "name": "Databricks AI Research" } } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "AI for Enterprise Search" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "AI for Enterprise Search", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "KARL: Knowledge Agents via Reinforcement Learning", "item": "https://sciencetostartup.com/paper/karl-knowledge-agents-via-reinforcement-learning" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"KARL: Knowledge Agents via Reinforcement Learning\"?", "acceptedAnswer": { "@type": "Answer", "text": "KARL leverages innovative reinforcement learning for affordable, high-performance enterprise search agents." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "Commercialize KARL as a customizable enterprise search platform that businesses can integrate into their internal systems to improve data retrieval and synthesis across complex proprietary datasets." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "Develop on-demand enterprise-grade search agents tailored for industries heavily relying on document-based information like finance and healthcare, enhancing their internal data handling and decision-making processes." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "KARL could disrupt traditional search and data retrieval systems in enterprises, replacing less efficient and non-AI-based information retrieval methods with its more advanced and adaptable system." } } ] } ] }

Competitive landscape

KARL leverages innovative reinforcement learning for affordable, high-performance enterprise search agents.

Segment

AI for Enterprise Search

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

KARL: Knowledge Agents via Reinforcement Learning

KARL: Knowledge Agents via Reinforcement Learning

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline