ARXIV:2606.06486 · ADAPTIVE DECISION MAKING · SUBMITTED 06 JUN · 03:17 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Regret Minimization with Adaptive Opponents in Repeated Games

Mingyang Liu · Asuman Ozdaglar · Tiancheng Yu · Kaiqing Zhang · arXiv

Developing algorithms for minimizing regret in adaptive repeated games.

Blocked on Code›Score2.0Evidence unverified

Opportunity summary

Pain Developing algorithms for minimizing regret in adaptive repeated games.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Developing algorithms for minimizing regret in adaptive repeated games. The standard metric of \emph{external regret} in online learning is known to fail to capture such adaptivity.

METHOD

Full abstract

In this paper, we study regret minimization in repeated games with \emph{adaptive} opponents who can respond based on histories of play. The standard metric of \emph{external regret} in online learning is known to fail to capture such adaptivity. To account for players' counterfactual reasoning, we introduce {\tt Repeated Policy Regret (RP-Regret)}, a game-theoretic metric that measures the difference between the \emph{realized} and the \emph{best-in-hindsight} accumulated utility when all players can \emph{respond} to the history of play. Compared to existing regret notions in this setting, ours is native to repeated game playing, enabling stronger comparators and opponents with fewer constraints, while maintaining the possibility of finding better equilibria when all players minimize it. We first identify necessary conditions for obtaining {\tt RP-Regret} sublinear in time, on the variation of the player's comparator strategies in the regret definition and on the memories of both the comparator and opponents' strategies. We then study additional conditions and provable algorithms to minimize {\tt RP-Regret}, which is by definition \emph{non-convex} in the strategy space. To address this challenge, we propose three algorithms: (i) one based on an optimization oracle, as assumed in some prior work in online non-convex learning; (ii) one that minimizes a convex and \emph{linearized} surrogate of {\tt RP-Regret} at each iteration; (iii) one that directly minimizes {\tt RP-Regret} when opponents change strategies slowly. Furthermore, when all players can run algorithms to minimize the {\tt RP-Regret} (or its linearized variant), certain subgame perfect equilibria of the repeated game can be learned. We also provide experiments showing that minimizing our regret notions can lead to more cooperative solutions with higher utility in games such as Stag-Hunt.

RESULT

ScienceToStartup currently rates this 2.0/10 on the public viability pass. We also provide experiments showing that minimizing our regret notions can lead to more cooperative solutions with higher utility in games such as Stag-Hunt.

WHY NOW

Adaptive Decision Making moved forward this cycle; last verified June 2026. Public score 2.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score2.0

PainDeveloping algorithms for minimizing regret in adaptive repeated games.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

Developing algorithms for minimizing regret in adaptive repeated games.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

{ "contract_version": "paper-r2", "paper_id": "7437538b-15e5-4847-8f80-32689801443c", "arxiv_id": "2606.06486", "canonical_route": "/paper/regret-minimization-with-adaptive-opponents-in-repeated-games", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "regret-minimization-with-adaptive-opponents-in-repeated-games", "endpoints": { "paper_pack": "/api/v1/paper/regret-minimization-with-adaptive-opponents-in-repeated-games/paper-pack", "build_passport": "/api/v1/paper/regret-minimization-with-adaptive-opponents-in-repeated-games/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Regret Minimization with Adaptive Opponents in Repeated Games", "normalized_query": "2606.06486", "route": "/paper/regret-minimization-with-adaptive-opponents-in-repeated-games", "paper_ref": "regret-minimization-with-adaptive-opponents-in-repeated-games", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/regret-minimization-with-adaptive-opponents-in-repeated-games#webpage", "url": "https://sciencetostartup.com/paper/regret-minimization-with-adaptive-opponents-in-repeated-games", "name": "Regret Minimization with Adaptive Opponents in Repeated Games", "description": "Developing algorithms for minimizing regret in adaptive repeated games.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/regret-minimization-with-adaptive-opponents-in-repeated-games#scholarlyArticle", "headline": "Regret Minimization with Adaptive Opponents in Repeated Games", "description": "Developing algorithms for minimizing regret in adaptive repeated games.", "url": "https://sciencetostartup.com/paper/regret-minimization-with-adaptive-opponents-in-repeated-games", "sameAs": "https://arxiv.org/abs/2606.06486", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2606.06486" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-06-04T17:59:08.000Z", "author": [ { "@type": "Person", "name": "Mingyang Liu" }, { "@type": "Person", "name": "Asuman Ozdaglar" }, { "@type": "Person", "name": "Tiancheng Yu" }, { "@type": "Person", "name": "Kaiqing Zhang" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 2 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Adaptive Decision Making" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Adaptive Decision Making", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Regret Minimization with Adaptive Opponents in Repeated Game", "item": "https://sciencetostartup.com/paper/regret-minimization-with-adaptive-opponents-in-repeated-games" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"Regret Minimization with Adaptive Opponents in Repeated Game\"?", "acceptedAnswer": { "@type": "Answer", "text": "Developing algorithms for minimizing regret in adaptive repeated games." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "To productize, develop a software solution integrating these algorithms into decision-making platforms used by financial analysts and traders." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "Automated trading systems that adapt to changing market conditions and strategic actions by competitors." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "It could potentially replace static or purely historical-data based decision models in real-time trading platforms." } } ] } ] }

Regret Minimization with Adaptive Opponents in Repeated Games

Regret Minimization with Adaptive Opponents in Repeated Games

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline