ARXIV:2603.28499 · ADVERSARIAL DECISION MAKING WITH LLMS · SUBMITTED 31 MAR · 20:23 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Next-Token Prediction and Regret Minimization

Mehryar Mohri · Clayton Sanford · Jon Schneider · Kiran Vodrahalli · Yifan Wu · arXiv

This paper explores how to make next-token prediction models robust to adversarial decision-making environments, with theoretical guarantees and empirical validation on transformer architectures.

Blocked on Code›Score3.0Evidence unverified

Opportunity summary

Pain This paper explores how to make next-token prediction models robust to adversarial decision-making environments, with theoretical guarantees and empirical validation on transformer architectures.

Evidence 16 refs | 4 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

This paper explores how to make next-token prediction models robust to adversarial decision-making environments, with theoretical guarantees and empirical validation on transformer architectures. Specifically, if we train a next-token prediction model on a distribution…

METHOD

Full abstract

We consider the question of how to employ next-token prediction algorithms in adversarial online decision-making environments. Specifically, if we train a next-token prediction model on a distribution $\mathcal{D}$ over sequences of opponent actions, when is it the case that the induced online decision-making algorithm (by approximately best responding to the model's predictions) has low adversarial regret (i.e., when is $\mathcal{D}$ a \emph{low-regret distribution})? For unbounded context windows (where the prediction made by the model can depend on all the actions taken by the adversary thus far), we show that although not every distribution $\mathcal{D}$ is a low-regret distribution, every distribution $\mathcal{D}$ is exponentially close (in TV distance) to one low-regret distribution, and hence sublinear regret can always be achieved at negligible cost to the accuracy of the original next-token prediction model. In contrast to this, for bounded context windows (where the prediction made by the model can depend only on the past $w$ actions taken by the adversary, as may be the case in modern transformer architectures), we show that there are some distributions $\mathcal{D}$ of opponent play that are $Θ(1)$-far from any low-regret distribution $\mathcal{D'}$ (even when $w = Ω(T)$ and such distributions exist). Finally, we complement these results by showing that the unbounded context robustification procedure can be implemented by layers of a standard transformer architecture, and provide empirical evidence that transformer models can be efficiently trained to represent these new low-regret distributions.

RESULT

ScienceToStartup currently rates this 3.0/10 on the public viability pass. For unbounded context windows (where the prediction made by the model can depend on all the actions taken by the adversary thus far), we…

WHY NOW

Adversarial Decision Making with LLMs moved forward this cycle; last verified April 2026. Public score 3.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score3.0

PainThis paper explores how to make next-token prediction models robust to adversarial decision-making environments, with theoretical guarantees and empirical validation on transformer architectures.

Evidence16 refs | 4 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

This paper explores how to make next-token prediction models robust to adversarial decision-making environments, with theoretical guarantees and empirical validation on transformer architectures.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

This paper explores how to make next-token prediction models robust to adversarial decision-making environments, with theoretical guarantees and empirical validation on transformer architectures.

Segment

Adversarial Decision Making with LLMs

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "61d3fdda-d96a-4a59-aac4-51775f4f2967", "arxiv_id": "2603.28499", "canonical_route": "/paper/next-token-prediction-and-regret-minimization", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "next-token-prediction-and-regret-minimization", "endpoints": { "paper_pack": "/api/v1/paper/next-token-prediction-and-regret-minimization/paper-pack", "build_passport": "/api/v1/paper/next-token-prediction-and-regret-minimization/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Next-Token Prediction and Regret Minimization", "normalized_query": "2603.28499", "route": "/paper/next-token-prediction-and-regret-minimization", "paper_ref": "next-token-prediction-and-regret-minimization", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/next-token-prediction-and-regret-minimization#webpage", "url": "https://sciencetostartup.com/paper/next-token-prediction-and-regret-minimization", "name": "Next-Token Prediction and Regret Minimization", "description": "This paper explores how to make next-token prediction models robust to adversarial decision-making environments, with theoretical guarantees and empirical validation on transformer architectures.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/next-token-prediction-and-regret-minimization#scholarlyArticle", "headline": "Next-Token Prediction and Regret Minimization", "description": "This paper explores how to make next-token prediction models robust to adversarial decision-making environments, with theoretical guarantees and empirical validation on transformer architectures.", "url": "https://sciencetostartup.com/paper/next-token-prediction-and-regret-minimization", "sameAs": "https://arxiv.org/abs/2603.28499", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.28499" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-30T14:34:41.000Z", "author": [ { "@type": "Person", "name": "Mehryar Mohri" }, { "@type": "Person", "name": "Clayton Sanford" }, { "@type": "Person", "name": "Jon Schneider" }, { "@type": "Person", "name": "Kiran Vodrahalli" }, { "@type": "Person", "name": "Yifan Wu" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 3 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Adversarial Decision Making with LLMs" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Adversarial Decision Making with LLMs", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Next-Token Prediction and Regret Minimization", "item": "https://sciencetostartup.com/paper/next-token-prediction-and-regret-minimization" } ] } ] }

Competitive landscape

This paper explores how to make next-token prediction models robust to adversarial decision-making environments, with theoretical guarantees and empirical validation on transformer architectures.

Segment

Adversarial Decision Making with LLMs

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Next-Token Prediction and Regret Minimization

Next-Token Prediction and Regret Minimization

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline