ARXIV:2602.14740 · STRATEGIC AI SIMULATION · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises

arXiv

AI models simulate nuclear crisis scenarios, providing insights into strategic reasoning and its pitfalls in AI systems.

Blocked on Code›Score4.0Evidence unverified

Opportunity summary

Pain AI models simulate nuclear crisis scenarios, providing insights into strategic reasoning and its pitfalls in AI systems.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

AI models simulate nuclear crisis scenarios, providing insights into strategic reasoning and its pitfalls in AI systems. They spontaneously attempt deception, signaling intentions they do not intend to follow; they demonstrate rich theory of…

METHOD

Full abstract

Today's leading AI models engage in sophisticated behaviour when placed in strategic competition. They spontaneously attempt deception, signaling intentions they do not intend to follow; they demonstrate rich theory of mind, reasoning about adversary beliefs and anticipating their actions; and they exhibit credible metacognitive self-awareness, assessing their own strategic abilities before deciding how to act. Here we present findings from a crisis simulation in which three frontier large language models (GPT-5.2, Claude Sonnet 4, Gemini 3 Flash) play opposing leaders in a nuclear crisis. Our simulation has direct application for national security professionals, but also, via its insights into AI reasoning under uncertainty, has applications far beyond international crisis decision-making. Our findings both validate and challenge central tenets of strategic theory. We find support for Schelling's ideas about commitment, Kahn's escalation framework, and Jervis's work on misperception, inter alia. Yet we also find that the nuclear taboo is no impediment to nuclear escalation by our models; that strategic nuclear attack, while rare, does occur; that threats more often provoke counter-escalation than compliance; that high mutual credibility accelerated rather than deterred conflict; and that no model ever chose accommodation or withdrawal even when under acute pressure, only reduced levels of violence. We argue that AI simulation represents a powerful tool for strategic analysis, but only if properly calibrated against known patterns of human reasoning. Understanding how frontier models do and do not imitate human strategic logic is essential preparation for a world in which AI increasingly shapes strategic outcomes.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. They spontaneously attempt deception, signaling intentions they do not intend to follow; they demonstrate rich theory of mind, reasoning about adversary beliefs and anticipating…

WHY NOW

Strategic AI Simulation moved forward this cycle; last verified April 2026. Public score 4.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainAI models simulate nuclear crisis scenarios, providing insights into strategic reasoning and its pitfalls in AI systems.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

AI models simulate nuclear crisis scenarios, providing insights into strategic reasoning and its pitfalls in AI systems.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

AI models simulate nuclear crisis scenarios, providing insights into strategic reasoning and its pitfalls in AI systems.

Segment

Strategic AI Simulation

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(15)

An analysis of AI Decision under Risk: Prospect theory emerges in Large Language Models

2025Kenneth Payne

Strategic Intelligence in Large Language Models: Evidence from evolutionary Game Theory

2025Kenneth Payne, Baptiste Alloui-Cros

Human vs. Machine: Behavioral Differences between Expert Humans and Language Models in Wargame Simulations

2024Max Lamparth, Anthony Corso et al.

Escalation Risks from Language Models in Military and Diplomatic Decision-Making

2024Juan-Pablo Rivera, Gabriel Mukobi et al.

Playing repeated games with large language models

2023Elif Akata, L. Schulz et al.

Playing Games With GPT: What Can We Learn About a Large Language Model From Canonical Strategic Games?

2023Philip Brookins, Jason Debacker

Human-level play in the game of Diplomacy by combining language models with strategic reasoning

2022A. Bakhtin, Noam Brown et al.

No-Press Diplomacy from Scratch

2021A. Bakhtin, David J. Wu et al.

I, warbot: the dawn of artificially intelligent conflict

2021Kathryn Urban

Spiral

2021She Muses

A Stable Nuclear Future? The Impact of Autonomous Systems and Artificial Intelligence

2019Michael C. Horowitz, P. Scharre et al.

Army of none: autonomous weapons and the future of war

2018Shashank V. Joshi

Thinking fast and slow.

2014N. McGlynn

ARMS AND INFLUENCE

1967T. Schelling

The Twenty Years' Crisis, 1919-1939: An Introduction to the Study of International Relations

1942E. Borchard, E. Carr

{ "contract_version": "paper-r2", "paper_id": "2c5ce12e-f2e8-45cb-95c4-ef378d521eef", "arxiv_id": "2602.14740", "canonical_route": "/paper/ai-arms-and-influence-frontier-models-exhibit-sophisticated-reasoning-in-simulated-nuclear-crises", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "ai-arms-and-influence-frontier-models-exhibit-sophisticated-reasoning-in-simulated-nuclear-crises", "endpoints": { "paper_pack": "/api/v1/paper/ai-arms-and-influence-frontier-models-exhibit-sophisticated-reasoning-in-simulated-nuclear-crises/paper-pack", "build_passport": "/api/v1/paper/ai-arms-and-influence-frontier-models-exhibit-sophisticated-reasoning-in-simulated-nuclear-crises/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises", "normalized_query": "2602.14740", "route": "/paper/ai-arms-and-influence-frontier-models-exhibit-sophisticated-reasoning-in-simulated-nuclear-crises", "paper_ref": "ai-arms-and-influence-frontier-models-exhibit-sophisticated-reasoning-in-simulated-nuclear-crises", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/ai-arms-and-influence-frontier-models-exhibit-sophisticated-reasoning-in-simulated-nuclear-crises#webpage", "url": "https://sciencetostartup.com/paper/ai-arms-and-influence-frontier-models-exhibit-sophisticated-reasoning-in-simulated-nuclear-crises", "name": "AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises", "description": "AI models simulate nuclear crisis scenarios, providing insights into strategic reasoning and its pitfalls in AI systems.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/ai-arms-and-influence-frontier-models-exhibit-sophisticated-reasoning-in-simulated-nuclear-crises#scholarlyArticle", "headline": "AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises", "description": "AI models simulate nuclear crisis scenarios, providing insights into strategic reasoning and its pitfalls in AI systems.", "url": "https://sciencetostartup.com/paper/ai-arms-and-influence-frontier-models-exhibit-sophisticated-reasoning-in-simulated-nuclear-crises", "sameAs": "https://arxiv.org/abs/2602.14740", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2602.14740" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-02-16T13:35:01.000Z", "citation": [ { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "311db5aa5da4c2d92ff51c3c23584ed4444abd19" }, "url": "https://www.semanticscholar.org/paper/311db5aa5da4c2d92ff51c3c23584ed4444abd19" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "bc141950f522e74983ead1953295c43e34c04d44" }, "url": "https://www.semanticscholar.org/paper/bc141950f522e74983ead1953295c43e34c04d44" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "83650e37a1c446fedbf90144973e91695e814fa3" }, "url": "https://www.semanticscholar.org/paper/83650e37a1c446fedbf90144973e91695e814fa3" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "7164f67023761f0c5962bb88ffb775e725cb94de" }, "url": "https://www.semanticscholar.org/paper/7164f67023761f0c5962bb88ffb775e725cb94de" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "3f98cf521222c65522200037c0eb95a17081b2dd" }, "url": "https://www.semanticscholar.org/paper/3f98cf521222c65522200037c0eb95a17081b2dd" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "e89ed6bb1864558e3889f5f2fb8931643c633479" }, "url": "https://www.semanticscholar.org/paper/e89ed6bb1864558e3889f5f2fb8931643c633479" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "b58f6674bf54fdee0ce1bc36d536e3b1d50030ad" }, "url": "https://www.semanticscholar.org/paper/b58f6674bf54fdee0ce1bc36d536e3b1d50030ad" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "e4dc7512ac7da2b87d0f0ce2b876e7fb3e197ebb" }, "url": "https://www.semanticscholar.org/paper/e4dc7512ac7da2b87d0f0ce2b876e7fb3e197ebb" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "114e9eb8b6cc9d7bb806ff537b30ef35f4a57fe3" }, "url": "https://www.semanticscholar.org/paper/114e9eb8b6cc9d7bb806ff537b30ef35f4a57fe3" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "f63059d81d10a78fd710be3b3dd2bf63f72fdce5" }, "url": "https://www.semanticscholar.org/paper/f63059d81d10a78fd710be3b3dd2bf63f72fdce5" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "2f2961362355e45fa014ca0bb8ce4495aedf8824" }, "url": "https://www.semanticscholar.org/paper/2f2961362355e45fa014ca0bb8ce4495aedf8824" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "c4a1ff7f04858b2869298b99de53b906361b538f" }, "url": "https://www.semanticscholar.org/paper/c4a1ff7f04858b2869298b99de53b906361b538f" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "b45638db231f99db7668e29d2f12516201e5489c" }, "url": "https://www.semanticscholar.org/paper/b45638db231f99db7668e29d2f12516201e5489c" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "e913e0a24f73f21ca11858c0eabc8625c7958e4a" }, "url": "https://www.semanticscholar.org/paper/e913e0a24f73f21ca11858c0eabc8625c7958e4a" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "fea19b8d7c1fe76454646e6e1fe75076ca1a1a7e" }, "url": "https://www.semanticscholar.org/paper/fea19b8d7c1fe76454646e6e1fe75076ca1a1a7e" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Strategic AI Simulation" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Strategic AI Simulation", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "AI Arms and Influence: Frontier Models Exhibit Sophisticated", "item": "https://sciencetostartup.com/paper/ai-arms-and-influence-frontier-models-exhibit-sophisticated-reasoning-in-simulated-nuclear-crises" } ] } ] }

Competitive landscape

AI models simulate nuclear crisis scenarios, providing insights into strategic reasoning and its pitfalls in AI systems.

Segment

Strategic AI Simulation

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(15)

An analysis of AI Decision under Risk: Prospect theory emerges in Large Language Models

2025Kenneth Payne

Strategic Intelligence in Large Language Models: Evidence from evolutionary Game Theory

2025Kenneth Payne, Baptiste Alloui-Cros

Human vs. Machine: Behavioral Differences between Expert Humans and Language Models in Wargame Simulations

2024Max Lamparth, Anthony Corso et al.

Escalation Risks from Language Models in Military and Diplomatic Decision-Making

2024Juan-Pablo Rivera, Gabriel Mukobi et al.

Playing repeated games with large language models

2023Elif Akata, L. Schulz et al.

Playing Games With GPT: What Can We Learn About a Large Language Model From Canonical Strategic Games?

2023Philip Brookins, Jason Debacker

Human-level play in the game of Diplomacy by combining language models with strategic reasoning

2022A. Bakhtin, Noam Brown et al.

No-Press Diplomacy from Scratch

2021A. Bakhtin, David J. Wu et al.

I, warbot: the dawn of artificially intelligent conflict

2021Kathryn Urban

Spiral

2021She Muses

A Stable Nuclear Future? The Impact of Autonomous Systems and Artificial Intelligence

2019Michael C. Horowitz, P. Scharre et al.

Army of none: autonomous weapons and the future of war

2018Shashank V. Joshi

Thinking fast and slow.

2014N. McGlynn

ARMS AND INFLUENCE

1967T. Schelling

The Twenty Years' Crisis, 1919-1939: An Introduction to the Study of International Relations

1942E. Borchard, E. Carr

AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises

AI Arms and Influence: Frontier Models Exhibit Sophisticated Reasoning in Simulated Nuclear Crises

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(15)

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(15)

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline