ARXIV:2605.14218 · AI SAFETY · SUBMITTED 15 MAY · 20:13 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Fusion-fission forecasts when AI will shift to undesirable behavior

Neil F. Johnson · Frank Yingjie Huo · arXiv

A predictive model for forecasting when AI behavior will shift to undesirable outcomes, offering a real-time warning signal.

Blocked on Code›Score4.0Evidence unverified

Opportunity summary

Pain A predictive model for forecasting when AI behavior will shift to undesirable outcomes, offering a real-time warning signal.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A predictive model for forecasting when AI behavior will shift to undesirable outcomes, offering a real-time warning signal. Shifts persist in even the newest AI models despite remarkable progress in AI modeling, post-training alignment…

METHOD

Full abstract

The key problem facing ChatGPT-like AI's use across society is that its behavior can shift, unnoticed, from desirable to undesirable -- encouraging self-harm, extremist acts, financial losses, or costly medical and military mistakes -- and no one can yet predict when. Shifts persist in even the newest AI models despite remarkable progress in AI modeling, post-training alignment and safeguards. Here we show that a vector generalization of fusion-fission group dynamics observed in living and active-matter systems drives -- and can forecast -- future shifts in the AI's behavior. The shift condition, which is also derivable mathematically, results from group-level competition between the conversation-so-far (C) and the desirable (B) and undesirable (D) basin dynamics which can be estimated in advance for a given application. It is neither model-specific nor driven by stochastic sampling. We validate it across six independent tests, including: 90 percent correct across seven AI models spanning two orders of magnitude in parameter count (124M-12B); production-scale persistence across ten frontier chatbots; and a priori time-stamped prediction eleven months before the Stanford 'Delusional Spirals' corpus appeared, and independently confirmed by that corpus of 207,443 human-AI exchanges. Because it sits architecturally below the current safety stack, the same formula provides a real-time warning signal that current alignment does not supply, portable across current and future ChatGPT-like AI architectures and instantiable in application domains where competing response classes can be defined.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. Here we show that a vector generalization of fusion-fission group dynamics observed in living and active-matter systems drives -- and can forecast -- future…

WHY NOW

AI Safety moved forward this cycle; last verified May 2026. Public score 4.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainA predictive model for forecasting when AI behavior will shift to undesirable outcomes, offering a real-time warning signal.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

A predictive model for forecasting when AI behavior will shift to undesirable outcomes, offering a real-time warning signal.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A predictive model for forecasting when AI behavior will shift to undesirable outcomes, offering a real-time warning signal.

Segment

AI Safety

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "c3fa7d72-f8a4-493f-abce-6da2269b4a93", "arxiv_id": "2605.14218", "canonical_route": "/paper/fusion-fission-forecasts-when-ai-will-shift-to-undesirable-behavior", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "fusion-fission-forecasts-when-ai-will-shift-to-undesirable-behavior", "endpoints": { "paper_pack": "/api/v1/paper/fusion-fission-forecasts-when-ai-will-shift-to-undesirable-behavior/paper-pack", "build_passport": "/api/v1/paper/fusion-fission-forecasts-when-ai-will-shift-to-undesirable-behavior/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Fusion-fission forecasts when AI will shift to undesirable behavior", "normalized_query": "2605.14218", "route": "/paper/fusion-fission-forecasts-when-ai-will-shift-to-undesirable-behavior", "paper_ref": "fusion-fission-forecasts-when-ai-will-shift-to-undesirable-behavior", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/fusion-fission-forecasts-when-ai-will-shift-to-undesirable-behavior#webpage", "url": "https://sciencetostartup.com/paper/fusion-fission-forecasts-when-ai-will-shift-to-undesirable-behavior", "name": "Fusion-fission forecasts when AI will shift to undesirable behavior", "description": "A predictive model for forecasting when AI behavior will shift to undesirable outcomes, offering a real-time warning signal.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/fusion-fission-forecasts-when-ai-will-shift-to-undesirable-behavior#scholarlyArticle", "headline": "Fusion-fission forecasts when AI will shift to undesirable behavior", "description": "A predictive model for forecasting when AI behavior will shift to undesirable outcomes, offering a real-time warning signal.", "url": "https://sciencetostartup.com/paper/fusion-fission-forecasts-when-ai-will-shift-to-undesirable-behavior", "sameAs": "https://arxiv.org/abs/2605.14218", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.14218" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-14T00:26:32.000Z", "author": [ { "@type": "Person", "name": "Neil F. Johnson" }, { "@type": "Person", "name": "Frank Yingjie Huo" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "AI Safety" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "AI Safety", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Fusion-fission forecasts when AI will shift to undesirable b", "item": "https://sciencetostartup.com/paper/fusion-fission-forecasts-when-ai-will-shift-to-undesirable-behavior" } ] } ] }

Competitive landscape

A predictive model for forecasting when AI behavior will shift to undesirable outcomes, offering a real-time warning signal.

Segment

AI Safety

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Fusion-fission forecasts when AI will shift to undesirable behavior

Fusion-fission forecasts when AI will shift to undesirable behavior

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline