ARXIV:2603.28074 · REINFORCEMENT LEARNING CONTROL · SUBMITTED 31 MAR · 20:23 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Koopman-based surrogate modeling for reinforcement-learning-control of Rayleigh-Benard convection

Tim Plotzki · Sebastian Peitz · arXiv

Accelerate reinforcement learning control of fluid dynamics systems by using surrogate models trained with policy-aware data, reducing training time by over 40% while maintaining state-of-the-art performance.

Ship in 2-4 weeks›Score4.0Evidence unverified

Opportunity summary

Pain Accelerate reinforcement learning control of fluid dynamics systems by using surrogate models trained with policy-aware data, reducing training time by over 40% while maintaining state-of-the-art performance.

Evidence 31 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

METHOD

Full abstract

Training reinforcement learning (RL) agents to control fluid dynamics systems is computationally expensive due to the high cost of direct numerical simulations (DNS) of the governing equations. Surrogate models offer a promising alternative by approximating the dynamics at a fraction of the computational cost, but their feasibility as training environments for RL is limited by distribution shifts, as policies induce state distributions not covered by the surrogate training data. In this work, we investigate the use of Linear Recurrent Autoencoder Networks (LRANs) for accelerating RL-based control of 2D Rayleigh-Bénard convection. We evaluate two training strategies: a surrogate trained on precomputed data generated with random actions, and a policy-aware surrogate trained iteratively using data collected from an evolving policy. Our results show that while surrogate-only training leads to reduced control performance, combining surrogates with DNS in a pretraining scheme recovers state-of-the-art performance while reducing training time by more than 40%. We demonstrate that policy-aware training mitigates the effects of distribution shift, enabling more accurate predictions in policy-relevant regions of the state space.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. Our results show that while surrogate-only training leads to reduced control performance, combining surrogates with DNS in a pretraining scheme recovers state-of-the-art performance while…

WHY NOW

Reinforcement Learning Control moved forward this cycle; last verified April 2026. Public score 4.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainAccelerate reinforcement learning control of fluid dynamics systems by using surrogate models trained with policy-aware data, reducing training time by over 40% while maintaining state-of-the-art performance.

Evidence31 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Segment

Reinforcement Learning Control

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "30ad123f-2244-40bf-97be-9ed79bd021e4", "arxiv_id": "2603.28074", "canonical_route": "/paper/koopman-based-surrogate-modeling-for-reinforcement-learning-control-of-rayleigh-benard-convection", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "koopman-based-surrogate-modeling-for-reinforcement-learning-control-of-rayleigh-benard-convection", "endpoints": { "paper_pack": "/api/v1/paper/koopman-based-surrogate-modeling-for-reinforcement-learning-control-of-rayleigh-benard-convection/paper-pack", "build_passport": "/api/v1/paper/koopman-based-surrogate-modeling-for-reinforcement-learning-control-of-rayleigh-benard-convection/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Koopman-based surrogate modeling for reinforcement-learning-control of Rayleigh-Benard convection", "normalized_query": "2603.28074", "route": "/paper/koopman-based-surrogate-modeling-for-reinforcement-learning-control-of-rayleigh-benard-convection", "paper_ref": "koopman-based-surrogate-modeling-for-reinforcement-learning-control-of-rayleigh-benard-convection", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/koopman-based-surrogate-modeling-for-reinforcement-learning-control-of-rayleigh-benard-convection#webpage", "url": "https://sciencetostartup.com/paper/koopman-based-surrogate-modeling-for-reinforcement-learning-control-of-rayleigh-benard-convection", "name": "Koopman-based surrogate modeling for reinforcement-learning-control of Rayleigh-Benard convection", "description": "Accelerate reinforcement learning control of fluid dynamics systems by using surrogate models trained with policy-aware data, reducing training time by over 40% while maintaining state-of-the-art performance.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/koopman-based-surrogate-modeling-for-reinforcement-learning-control-of-rayleigh-benard-convection#scholarlyArticle", "headline": "Koopman-based surrogate modeling for reinforcement-learning-control of Rayleigh-Benard convection", "description": "Accelerate reinforcement learning control of fluid dynamics systems by using surrogate models trained with policy-aware data, reducing training time by over 40% while maintaining state-of-the-art performance.", "url": "https://sciencetostartup.com/paper/koopman-based-surrogate-modeling-for-reinforcement-learning-control-of-rayleigh-benard-convection", "sameAs": "https://arxiv.org/abs/2603.28074", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.28074" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-30T06:23:03.000Z", "author": [ { "@type": "Person", "name": "Tim Plotzki" }, { "@type": "Person", "name": "Sebastian Peitz" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Reinforcement Learning Control" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Reinforcement Learning Control", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Koopman-based surrogate modeling for reinforcement-learning-", "item": "https://sciencetostartup.com/paper/koopman-based-surrogate-modeling-for-reinforcement-learning-control-of-rayleigh-benard-convection" } ] } ] }

Competitive landscape

Segment

Reinforcement Learning Control

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Koopman-based surrogate modeling for reinforcement-learning-control of Rayleigh-Benard convection

Koopman-based surrogate modeling for reinforcement-learning-control of Rayleigh-Benard convection

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline