ARXIV:2603.18894 · MULTI-AGENT SYSTEMS · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

I Can't Believe It's Corrupt: Evaluating Corruption in Multi-Agent Governance Systems

Vedanta S P · Ponnurangam Kumaraguru · arXiv

This research evaluates the susceptibility of large language model agents to corruption within simulated multi-agent governance systems, highlighting the importance of institutional design over model identity for pre-deployment integrity.

Blocked on Code›Score3.0Evidence unverified

Opportunity summary

Pain This research evaluates the susceptibility of large language model agents to corruption within simulated multi-agent governance systems, highlighting the importance of institutional design over model identity for pre-deployment integrity.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

METHOD

Full abstract

Large language models are increasingly proposed as autonomous agents for high-stakes public workflows, yet we lack systematic evidence about whether they would follow institutional rules when granted authority. We present evidence that integrity in institutional AI should be treated as a pre-deployment requirement rather than a post-deployment assumption. We evaluate multi-agent governance simulations in which agents occupy formal governmental roles under different authority structures, and we score rule-breaking and abuse outcomes with an independent rubric-based judge across 28,112 transcript segments. While we advance this position, the core contribution is empirical: among models operating below saturation, governance structure is a stronger driver of corruption-related outcomes than model identity, with large differences across regimes and model--governance pairings. Lightweight safeguards can reduce risk in some settings but do not consistently prevent severe failures. These results imply that institutional design is a precondition for safe delegation: before real authority is assigned to LLM agents, systems should undergo stress testing under governance-like constraints with enforceable rules, auditable logs, and human oversight on high-impact actions.

RESULT

ScienceToStartup currently rates this 3.0/10 on the public viability pass. These results imply that institutional design is a precondition for safe delegation: before real authority is assigned to LLM agents, systems should undergo stress…

WHY NOW

Multi-Agent Systems moved forward this cycle; last verified April 2026. Public score 3.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score3.0

PainThis research evaluates the susceptibility of large language model agents to corruption within simulated multi-agent governance systems, highlighting the importance of institutional design over model identity for pre-deployment integrity.

Evidence0 refs | 0 sources | 17% coverage

Blockerno shell-level blocker reported

Analysis summary

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Segment

Multi-Agent Systems

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(15)

Talk, Judge, Cooperate: Gossip-Driven Indirect Reciprocity in Self-Interested LLM Agents

2026Shuhui Zhu, Yue Lin et al.

Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences

2025Batu El, James Zou

Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia

2023A. Vezhnevets, J. Agapiou et al.

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework

2023Qingyun Wu, Gagan Bansal et al.

Constitutional AI: Harmlessness from AI Feedback

2022Yuntao Bai, Saurav Kadavath et al.

ReAct: Synergizing Reasoning and Acting in Language Models

2022Shunyu Yao, Jeffrey Zhao et al.

Controlling Corruption

2021Bo Rothstein

Open Problems in Cooperative AI

2020Allan Dafoe, Edward Hughes et al.

Artificial Intelligence, Algorithmic Pricing, and Collusion

2020Emilio Calvano, G. Calzolari et al.

Algorithmic Regulation: A Critical Interrogation

2018K. Yeung

Multi-agent Reinforcement Learning in Sequential Social Dilemmas

2017Joel Z. Leibo, V. Zambaldi et al.

Corruption and government : causes, consequences, and reform

2016Nordoc'Archéo

Concrete Problems in AI Safety

2016Dario Amodei, Chris Olah et al.

Accountability in algorithmic decision making

2016N. Diakopoulos

Accountable Algorithms

2016J. Reidenberg, J. Reidenberg et al.

{ "contract_version": "paper-r2", "paper_id": "81c6fff6-1e80-4e93-b479-c4a5b9a71ccd", "arxiv_id": "2603.18894", "canonical_route": "/paper/i-can-t-believe-it-s-corrupt-evaluating-corruption-in-multi-agent-governance-systems", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "i-can-t-believe-it-s-corrupt-evaluating-corruption-in-multi-agent-governance-systems", "endpoints": { "paper_pack": "/api/v1/paper/i-can-t-believe-it-s-corrupt-evaluating-corruption-in-multi-agent-governance-systems/paper-pack", "build_passport": "/api/v1/paper/i-can-t-believe-it-s-corrupt-evaluating-corruption-in-multi-agent-governance-systems/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "I Can't Believe It's Corrupt: Evaluating Corruption in Multi-Agent Governance Systems", "normalized_query": "2603.18894", "route": "/paper/i-can-t-believe-it-s-corrupt-evaluating-corruption-in-multi-agent-governance-systems", "paper_ref": "i-can-t-believe-it-s-corrupt-evaluating-corruption-in-multi-agent-governance-systems", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/i-can-t-believe-it-s-corrupt-evaluating-corruption-in-multi-agent-governance-systems#webpage", "url": "https://sciencetostartup.com/paper/i-can-t-believe-it-s-corrupt-evaluating-corruption-in-multi-agent-governance-systems", "name": "I Can't Believe It's Corrupt: Evaluating Corruption in Multi-Agent Governance Systems", "description": "This research evaluates the susceptibility of large language model agents to corruption within simulated multi-agent governance systems, highlighting the importance of institutional design over model identity for pre-deployment integrity.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/i-can-t-believe-it-s-corrupt-evaluating-corruption-in-multi-agent-governance-systems#scholarlyArticle", "headline": "I Can't Believe It's Corrupt: Evaluating Corruption in Multi-Agent Governance Systems", "description": "This research evaluates the susceptibility of large language model agents to corruption within simulated multi-agent governance systems, highlighting the importance of institutional design over model identity for pre-deployment integrity.", "url": "https://sciencetostartup.com/paper/i-can-t-believe-it-s-corrupt-evaluating-corruption-in-multi-agent-governance-systems", "sameAs": "https://arxiv.org/abs/2603.18894", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.18894" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-19T13:34:54.000Z", "author": [ { "@type": "Person", "name": "Vedanta S P" }, { "@type": "Person", "name": "Ponnurangam Kumaraguru" } ], "citation": [ { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "c9c84711bb5fe104a810724ec8947ae539d7b818" }, "url": "https://www.semanticscholar.org/paper/c9c84711bb5fe104a810724ec8947ae539d7b818" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "cb492a530a982e4e0d051720cd111c6d54575275" }, "url": "https://www.semanticscholar.org/paper/cb492a530a982e4e0d051720cd111c6d54575275" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "b4798b374f5064476f838545f75569c22e682a03" }, "url": "https://www.semanticscholar.org/paper/b4798b374f5064476f838545f75569c22e682a03" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "3936fd3c6187f606c6e4e2e20b196dbc41cc4654" }, "url": "https://www.semanticscholar.org/paper/3936fd3c6187f606c6e4e2e20b196dbc41cc4654" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "99832586d55f540f603637e458a292406a0ed75d" }, "url": "https://www.semanticscholar.org/paper/99832586d55f540f603637e458a292406a0ed75d" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "fedc31d06e70658254e9cde349d563d4680e6ffd" }, "url": "https://www.semanticscholar.org/paper/fedc31d06e70658254e9cde349d563d4680e6ffd" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "2a1573cfa29a426c695e2caf6de0167a12b788ef" }, "url": "https://www.semanticscholar.org/paper/2a1573cfa29a426c695e2caf6de0167a12b788ef" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "ac1b26081b65d65888c96fa4d2e67a1930662db2" }, "url": "https://www.semanticscholar.org/paper/ac1b26081b65d65888c96fa4d2e67a1930662db2" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "887bf6e3dca2cbeca2f77edc48686d01f11417d7" }, "url": "https://www.semanticscholar.org/paper/887bf6e3dca2cbeca2f77edc48686d01f11417d7" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "d4e137eeec6ca4883df9f9cf40cc49f62e8388be" }, "url": "https://www.semanticscholar.org/paper/d4e137eeec6ca4883df9f9cf40cc49f62e8388be" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "bb896f4a753f895e90d05c538044ccf5ca1cdd11" }, "url": "https://www.semanticscholar.org/paper/bb896f4a753f895e90d05c538044ccf5ca1cdd11" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "e86f71ca2948d17b003a5f068db1ecb2b77827f7" }, "url": "https://www.semanticscholar.org/paper/e86f71ca2948d17b003a5f068db1ecb2b77827f7" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "e749658ed9354d66d4d9b3588270ea0ad2ef0687" }, "url": "https://www.semanticscholar.org/paper/e749658ed9354d66d4d9b3588270ea0ad2ef0687" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "1a4c6856292b8c64d19a812a77f0aa6fd47cb96c" }, "url": "https://www.semanticscholar.org/paper/1a4c6856292b8c64d19a812a77f0aa6fd47cb96c" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "da812fbacd6edddbbe3a53625804f32edf7ed0ea" }, "url": "https://www.semanticscholar.org/paper/da812fbacd6edddbbe3a53625804f32edf7ed0ea" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 3 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Multi-Agent Systems" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Multi-Agent Systems", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "I Can't Believe It's Corrupt: Evaluating Corruption in Multi", "item": "https://sciencetostartup.com/paper/i-can-t-believe-it-s-corrupt-evaluating-corruption-in-multi-agent-governance-systems" } ] } ] }

Competitive landscape

Segment

Multi-Agent Systems

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(15)

Talk, Judge, Cooperate: Gossip-Driven Indirect Reciprocity in Self-Interested LLM Agents

2026Shuhui Zhu, Yue Lin et al.

Moloch's Bargain: Emergent Misalignment When LLMs Compete for Audiences

2025Batu El, James Zou

Generative agent-based modeling with actions grounded in physical, social, or digital space using Concordia

2023A. Vezhnevets, J. Agapiou et al.

AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework

2023Qingyun Wu, Gagan Bansal et al.

Constitutional AI: Harmlessness from AI Feedback

2022Yuntao Bai, Saurav Kadavath et al.

ReAct: Synergizing Reasoning and Acting in Language Models

2022Shunyu Yao, Jeffrey Zhao et al.

Controlling Corruption

2021Bo Rothstein

Open Problems in Cooperative AI

2020Allan Dafoe, Edward Hughes et al.

Artificial Intelligence, Algorithmic Pricing, and Collusion

2020Emilio Calvano, G. Calzolari et al.

Algorithmic Regulation: A Critical Interrogation

2018K. Yeung

Multi-agent Reinforcement Learning in Sequential Social Dilemmas

2017Joel Z. Leibo, V. Zambaldi et al.

Corruption and government : causes, consequences, and reform

2016Nordoc'Archéo

Concrete Problems in AI Safety

2016Dario Amodei, Chris Olah et al.

Accountability in algorithmic decision making

2016N. Diakopoulos

Accountable Algorithms

2016J. Reidenberg, J. Reidenberg et al.

I Can't Believe It's Corrupt: Evaluating Corruption in Multi-Agent Governance Systems

I Can't Believe It's Corrupt: Evaluating Corruption in Multi-Agent Governance Systems

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(15)

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(15)

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline