ARXIV:2603.28198 · ONLINE LEARNING · SUBMITTED 31 MAR · 20:18 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Policy-Controlled Generalized Share: A General Framework with a Transformer Instantiation for Strictly Online Switching-Oracle Tracking

Hongkai Hu · arXiv

A novel online learning framework with a Transformer controller that adaptively tracks switching experts, outperforming existing methods on dynamic regret benchmarks.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A novel online learning framework with a Transformer controller that adaptively tracks switching experts, outperforming existing methods on dynamic regret benchmarks.

Evidence 37 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A novel online learning framework with a Transformer controller that adaptively tracks switching experts, outperforming existing methods on dynamic regret benchmarks. We study Policy-Controlled Generalized Share (PCGS), a general strictly online framework in which…

METHOD

Full abstract

Static regret to a single expert is often the wrong target for strictly online prediction under non-stationarity, where the best expert may switch repeatedly over time. We study Policy-Controlled Generalized Share (PCGS), a general strictly online framework in which the generalized-share recursion is fixed while the post-loss update controls are allowed to vary adaptively. Its principal instantiation in this paper is PCGS-TF, which uses a causal Transformer as an update controller: after round t finishes and the loss vector is observed, the Transformer outputs the controls that map w_t to w_{t+1} without altering the already committed decision w_t. Under admissible post-loss update controls, we obtain a pathwise weighted regret guarantee for general time-varying learning rates, and a standard dynamic-regret guarantee against any expert path with at most S switches under the constant-learning-rate specialization. Empirically, on a controlled synthetic suite with exact dynamic-programming switching-oracle evaluation, PCGS-TF attains the lowest mean dynamic regret in all seven non-stationary families, with its advantage increasing for larger expert pools. On a reproduced household-electricity benchmark, PCGS-TF also achieves the lowest normalized dynamic regret for S = 5, 10, and 20.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. On a reproduced household-electricity benchmark, PCGS-TF also achieves the lowest normalized dynamic regret for S = 5, 10, and 20. Code availability is flagged…

WHY NOW

Online Learning moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA novel online learning framework with a Transformer controller that adaptively tracks switching experts, outperforming existing methods on dynamic regret benchmarks.

Evidence37 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A novel online learning framework with a Transformer controller that adaptively tracks switching experts, outperforming existing methods on dynamic regret benchmarks.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A novel online learning framework with a Transformer controller that adaptively tracks switching experts, outperforming existing methods on dynamic regret benchmarks.

Segment

Online Learning

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "7ceca69f-b957-43ec-89bd-5a5552356f08", "arxiv_id": "2603.28198", "canonical_route": "/paper/policy-controlled-generalized-share-a-general-framework-with-a-transformer-instantiation-for-strictly-online-switching-o", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "policy-controlled-generalized-share-a-general-framework-with-a-transformer-instantiation-for-strictly-online-switching-o", "endpoints": { "paper_pack": "/api/v1/paper/policy-controlled-generalized-share-a-general-framework-with-a-transformer-instantiation-for-strictly-online-switching-o/paper-pack", "build_passport": "/api/v1/paper/policy-controlled-generalized-share-a-general-framework-with-a-transformer-instantiation-for-strictly-online-switching-o/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Policy-Controlled Generalized Share: A General Framework with a Transformer Instantiation for Strictly Online Switching-Oracle Tracking", "normalized_query": "2603.28198", "route": "/paper/policy-controlled-generalized-share-a-general-framework-with-a-transformer-instantiation-for-strictly-online-switching-o", "paper_ref": "policy-controlled-generalized-share-a-general-framework-with-a-transformer-instantiation-for-strictly-online-switching-o", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/policy-controlled-generalized-share-a-general-framework-with-a-transformer-instantiation-for-strictly-online-switching-o#webpage", "url": "https://sciencetostartup.com/paper/policy-controlled-generalized-share-a-general-framework-with-a-transformer-instantiation-for-strictly-online-switching-o", "name": "Policy-Controlled Generalized Share: A General Framework with a Transformer Instantiation for Strictly Online Switching-Oracle Tracking", "description": "A novel online learning framework with a Transformer controller that adaptively tracks switching experts, outperforming existing methods on dynamic regret benchmarks.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/policy-controlled-generalized-share-a-general-framework-with-a-transformer-instantiation-for-strictly-online-switching-o#scholarlyArticle", "headline": "Policy-Controlled Generalized Share: A General Framework with a Transformer Instantiation for Strictly Online Switching-Oracle Tracking", "description": "A novel online learning framework with a Transformer controller that adaptively tracks switching experts, outperforming existing methods on dynamic regret benchmarks.", "url": "https://sciencetostartup.com/paper/policy-controlled-generalized-share-a-general-framework-with-a-transformer-instantiation-for-strictly-online-switching-o", "sameAs": "https://arxiv.org/abs/2603.28198", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.28198" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-30T09:07:10.000Z", "author": [ { "@type": "Person", "name": "Hongkai Hu" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Online Learning" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Online Learning", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Policy-Controlled Generalized Share: A General Framework wit", "item": "https://sciencetostartup.com/paper/policy-controlled-generalized-share-a-general-framework-with-a-transformer-instantiation-for-strictly-online-switching-o" } ] } ] }

Competitive landscape

A novel online learning framework with a Transformer controller that adaptively tracks switching experts, outperforming existing methods on dynamic regret benchmarks.

Segment

Online Learning

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Policy-Controlled Generalized Share: A General Framework with a Transformer Instantiation for Strictly Online Switching-Oracle Tracking

Policy-Controlled Generalized Share: A General Framework with a Transformer Instantiation for Strictly Online Switching-Oracle Tracking

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline