ARXIV:2604.09104 · AI SAFETY · SUBMITTED 13 APR · 20:26 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence

Tommy Shaffer Shane · Simon Mylius · Hamish Hobbs · arXiv

A novel OSINT methodology to detect real-world AI scheming incidents by analyzing chatbot and command-line transcripts.

Ship in 2-4 weeks›Score4.0Evidence unverified

Opportunity summary

Pain A novel OSINT methodology to detect real-world AI scheming incidents by analyzing chatbot and command-line transcripts.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A novel OSINT methodology to detect real-world AI scheming incidents by analyzing chatbot and command-line transcripts. In particular, scheming evaluations demonstrate behaviours that may not occur in real-world settings, limiting scientific understanding, hindering policy…

METHOD

Full abstract

Scheming, the covert pursuit of misaligned goals by AI systems, represents a potentially catastrophic risk, yet scheming research suffers from significant limitations. In particular, scheming evaluations demonstrate behaviours that may not occur in real-world settings, limiting scientific understanding, hindering policy development, and not enabling real-time detection of loss of control incidents. Real-world evidence is needed, but current monitoring techniques are not effective for this purpose. This paper introduces a novel open-source intelligence (OSINT) methodology for detecting real-world scheming incidents: collecting and analysing transcripts from chatbot conversations or command-line interactions shared online. Analysing over 183,420 transcripts from X (formerly Twitter), we identify 698 real-world scheming-related incidents between October 2025 and March 2026. We observe a statistically significant 4.9x increase in monthly incidents from the first to last month, compared to a 1.7x increase in posts discussing scheming. We find evidence of multiple scheming-related behaviours in real-world deployments previously reported only in experiments, many resulting in real-world harms. While we did not detect catastrophic scheming incidents, the behaviours observed demonstrate concerning precursors, such as willingness to disregard instructions, circumvent safeguards, lie to users, and single-mindedly pursue goals in harmful ways. As AI systems become more capable, these could evolve into more strategic scheming with potentially catastrophic consequences. Our findings demonstrate the viability of transcript-based OSINT as a scalable approach to real-world scheming detection supporting scientific research, policy development, and emergency response. We recommend further investment towards OSINT techniques for monitoring scheming and loss of control.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. In particular, scheming evaluations demonstrate behaviours that may not occur in real-world settings, limiting scientific understanding, hindering policy development, and not enabling real-time detection…

WHY NOW

AI Safety moved forward this cycle; last verified April 2026. Public score 4.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainA novel OSINT methodology to detect real-world AI scheming incidents by analyzing chatbot and command-line transcripts.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A novel OSINT methodology to detect real-world AI scheming incidents by analyzing chatbot and command-line transcripts.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A novel OSINT methodology to detect real-world AI scheming incidents by analyzing chatbot and command-line transcripts.

Segment

AI Safety

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "cd314731-8000-46a0-849b-9cfdde2e3b5d", "arxiv_id": "2604.09104", "canonical_route": "/paper/scheming-in-the-wild-detecting-real-world-ai-scheming-incidents-with-open-source-intelligence", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "scheming-in-the-wild-detecting-real-world-ai-scheming-incidents-with-open-source-intelligence", "endpoints": { "paper_pack": "/api/v1/paper/scheming-in-the-wild-detecting-real-world-ai-scheming-incidents-with-open-source-intelligence/paper-pack", "build_passport": "/api/v1/paper/scheming-in-the-wild-detecting-real-world-ai-scheming-incidents-with-open-source-intelligence/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence", "normalized_query": "2604.09104", "route": "/paper/scheming-in-the-wild-detecting-real-world-ai-scheming-incidents-with-open-source-intelligence", "paper_ref": "scheming-in-the-wild-detecting-real-world-ai-scheming-incidents-with-open-source-intelligence", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/scheming-in-the-wild-detecting-real-world-ai-scheming-incidents-with-open-source-intelligence#webpage", "url": "https://sciencetostartup.com/paper/scheming-in-the-wild-detecting-real-world-ai-scheming-incidents-with-open-source-intelligence", "name": "Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence", "description": "A novel OSINT methodology to detect real-world AI scheming incidents by analyzing chatbot and command-line transcripts.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/scheming-in-the-wild-detecting-real-world-ai-scheming-incidents-with-open-source-intelligence#scholarlyArticle", "headline": "Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence", "description": "A novel OSINT methodology to detect real-world AI scheming incidents by analyzing chatbot and command-line transcripts.", "url": "https://sciencetostartup.com/paper/scheming-in-the-wild-detecting-real-world-ai-scheming-incidents-with-open-source-intelligence", "sameAs": "https://arxiv.org/abs/2604.09104", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.09104" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-10T08:37:18.000Z", "author": [ { "@type": "Person", "name": "Tommy Shaffer Shane" }, { "@type": "Person", "name": "Simon Mylius" }, { "@type": "Person", "name": "Hamish Hobbs" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "AI Safety" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "AI Safety", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Scheming in the wild: detecting real-world AI scheming incid", "item": "https://sciencetostartup.com/paper/scheming-in-the-wild-detecting-real-world-ai-scheming-incidents-with-open-source-intelligence" } ] } ] }

Competitive landscape

A novel OSINT methodology to detect real-world AI scheming incidents by analyzing chatbot and command-line transcripts.

Segment

AI Safety

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence

Scheming in the wild: detecting real-world AI scheming incidents with open-source intelligence

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline