ARXIV:2603.25111 · AGENTS · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

SEVerA: Verified Synthesis of Self-Evolving Agents

Debangshu Banerjee · Changming Xu · Gagandeep Singh · arXiv

A framework for generating self-evolving LLM agents with formal guarantees of safety and correctness.

Blocked on Code›Score4.0Evidence unverified

Opportunity summary

Pain A framework for generating self-evolving LLM agents with formal guarantees of safety and correctness.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A framework for generating self-evolving LLM agents with formal guarantees of safety and correctness. In this paradigm, a planner LLM synthesizes an agent program that invokes parametric models, including LLMs, which are then tuned…

METHOD

Full abstract

Recent advances have shown the effectiveness of self-evolving LLM agents on tasks such as program repair and scientific discovery. In this paradigm, a planner LLM synthesizes an agent program that invokes parametric models, including LLMs, which are then tuned per task to improve performance. However, existing self-evolving agent frameworks provide no formal guarantees of safety or correctness. Because such programs are often executed autonomously on unseen inputs, this lack of guarantees raises reliability and security concerns. We formulate agentic code generation as a constrained learning problem, combining hard formal specifications with soft objectives capturing task utility. We introduce Formally Guarded Generative Models (FGGM), which allow the planner LLM to specify a formal output contract for each generative model call using first-order logic. Each FGGM call wraps the underlying model in a rejection sampler with a verified fallback, ensuring every returned output satisfies the contract for any input and parameter setting. Building on FGGM, we present SEVerA (Self-Evolving Verified Agents), a three-stage framework: Search synthesizes candidate parametric programs containing FGGM calls; Verification proves correctness with respect to hard constraints for all parameter values, reducing the problem to unconstrained learning; and Learning applies scalable gradient-based optimization, including GRPO-style fine-tuning, to improve the soft objective while preserving correctness. We evaluate SEVerA on Dafny program verification, symbolic math synthesis, and policy-compliant agentic tool use ($τ^2$-bench). Across tasks, SEVerA achieves zero constraint violations while improving performance over unconstrained and SOTA baselines, showing that formal behavioral constraints not only guarantee correctness but also steer synthesis toward higher-quality agents.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. In this paradigm, a planner LLM synthesizes an agent program that invokes parametric models, including LLMs, which are then tuned per task to improve…

WHY NOW

Agents moved forward this cycle; last verified April 2026. Public score 4.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainA framework for generating self-evolving LLM agents with formal guarantees of safety and correctness.

Evidence0 refs | 0 sources | 17% coverage

Blockerno shell-level blocker reported

Analysis summary

A framework for generating self-evolving LLM agents with formal guarantees of safety and correctness.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

{ "contract_version": "paper-r2", "paper_id": "ea57ec2f-9a73-4b92-ac45-327c6f8883c7", "arxiv_id": "2603.25111", "canonical_route": "/paper/severa-verified-synthesis-of-self-evolving-agents", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "severa-verified-synthesis-of-self-evolving-agents", "endpoints": { "paper_pack": "/api/v1/paper/severa-verified-synthesis-of-self-evolving-agents/paper-pack", "build_passport": "/api/v1/paper/severa-verified-synthesis-of-self-evolving-agents/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "SEVerA: Verified Synthesis of Self-Evolving Agents", "normalized_query": "2603.25111", "route": "/paper/severa-verified-synthesis-of-self-evolving-agents", "paper_ref": "severa-verified-synthesis-of-self-evolving-agents", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/severa-verified-synthesis-of-self-evolving-agents#webpage", "url": "https://sciencetostartup.com/paper/severa-verified-synthesis-of-self-evolving-agents", "name": "SEVerA: Verified Synthesis of Self-Evolving Agents", "description": "A framework for generating self-evolving LLM agents with formal guarantees of safety and correctness.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/severa-verified-synthesis-of-self-evolving-agents#scholarlyArticle", "headline": "SEVerA: Verified Synthesis of Self-Evolving Agents", "description": "A framework for generating self-evolving LLM agents with formal guarantees of safety and correctness.", "url": "https://sciencetostartup.com/paper/severa-verified-synthesis-of-self-evolving-agents", "sameAs": "https://arxiv.org/abs/2603.25111", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.25111" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-26T07:32:20.000Z", "author": [ { "@type": "Person", "name": "Debangshu Banerjee" }, { "@type": "Person", "name": "Changming Xu" }, { "@type": "Person", "name": "Gagandeep Singh" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Agents" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "SEVerA: Verified Synthesis of Self-Evolving Agents", "item": "https://sciencetostartup.com/paper/severa-verified-synthesis-of-self-evolving-agents" } ] } ] }

SEVerA: Verified Synthesis of Self-Evolving Agents

SEVerA: Verified Synthesis of Self-Evolving Agents

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline