ARXIV:2603.26464 · REINFORCEMENT LEARNING · SUBMITTED 30 MAR · 23:58 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Automatic feature identification in least-squares policy iteration using the Koopman operator framework

Christian Mugisho Zagabe · Sebastian Peitz · arXiv

A novel reinforcement learning algorithm that automatically learns features for policy iteration, removing the need for manual feature engineering.

Blocked on Code›Score3.0Evidence unverified

Opportunity summary

Pain A novel reinforcement learning algorithm that automatically learns features for policy iteration, removing the need for manual feature engineering.

Evidence 14 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A novel reinforcement learning algorithm that automatically learns features for policy iteration, removing the need for manual feature engineering. The KAE-LSPI algorithm is based on reformulating the so-called least-squares fixed-point approximation method in terms…

METHOD

Full abstract

In this paper, we present a Koopman autoencoder-based least-squares policy iteration (KAE-LSPI) algorithm in reinforcement learning (RL). The KAE-LSPI algorithm is based on reformulating the so-called least-squares fixed-point approximation method in terms of extended dynamic mode decomposition (EDMD), thereby enabling automatic feature learning via the Koopman autoencoder (KAE) framework. The approach is motivated by the lack of a systematic choice of features or kernels in linear RL techniques. We compare the KAE-LSPI algorithm with two previous works, the classical least-squares policy iteration (LSPI) and the kernel-based least-squares policy iteration (KLSPI), using stochastic chain walk and inverted pendulum control problems as examples. Unlike previous works, no features or kernels need to be fixed a priori in our approach. Empirical results show the number of features learned by the KAE technique remains reasonable compared to those fixed in the classical LSPI algorithm. The convergence to an optimal or a near-optimal policy is also comparable to the other two methods.

RESULT

ScienceToStartup currently rates this 3.0/10 on the public viability pass. Empirical results show the number of features learned by the KAE technique remains reasonable compared to those fixed in the classical LSPI algorithm.

WHY NOW

Reinforcement Learning moved forward this cycle; last verified April 2026. Public score 3.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score3.0

PainA novel reinforcement learning algorithm that automatically learns features for policy iteration, removing the need for manual feature engineering.

Evidence14 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A novel reinforcement learning algorithm that automatically learns features for policy iteration, removing the need for manual feature engineering.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A novel reinforcement learning algorithm that automatically learns features for policy iteration, removing the need for manual feature engineering.

Segment

Reinforcement Learning

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "4f48f8bb-b06a-473c-8dc3-09354b930c5e", "arxiv_id": "2603.26464", "canonical_route": "/paper/automatic-feature-identification-in-least-squares-policy-iteration-using-the-koopman-operator-framework", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "automatic-feature-identification-in-least-squares-policy-iteration-using-the-koopman-operator-framework", "endpoints": { "paper_pack": "/api/v1/paper/automatic-feature-identification-in-least-squares-policy-iteration-using-the-koopman-operator-framework/paper-pack", "build_passport": "/api/v1/paper/automatic-feature-identification-in-least-squares-policy-iteration-using-the-koopman-operator-framework/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Automatic feature identification in least-squares policy iteration using the Koopman operator framework", "normalized_query": "2603.26464", "route": "/paper/automatic-feature-identification-in-least-squares-policy-iteration-using-the-koopman-operator-framework", "paper_ref": "automatic-feature-identification-in-least-squares-policy-iteration-using-the-koopman-operator-framework", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/automatic-feature-identification-in-least-squares-policy-iteration-using-the-koopman-operator-framework#webpage", "url": "https://sciencetostartup.com/paper/automatic-feature-identification-in-least-squares-policy-iteration-using-the-koopman-operator-framework", "name": "Automatic feature identification in least-squares policy iteration using the Koopman operator framework", "description": "A novel reinforcement learning algorithm that automatically learns features for policy iteration, removing the need for manual feature engineering.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/automatic-feature-identification-in-least-squares-policy-iteration-using-the-koopman-operator-framework#scholarlyArticle", "headline": "Automatic feature identification in least-squares policy iteration using the Koopman operator framework", "description": "A novel reinforcement learning algorithm that automatically learns features for policy iteration, removing the need for manual feature engineering.", "url": "https://sciencetostartup.com/paper/automatic-feature-identification-in-least-squares-policy-iteration-using-the-koopman-operator-framework", "sameAs": "https://arxiv.org/abs/2603.26464", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.26464" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-27T14:31:31.000Z", "author": [ { "@type": "Person", "name": "Christian Mugisho Zagabe" }, { "@type": "Person", "name": "Sebastian Peitz" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 3 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Reinforcement Learning" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Reinforcement Learning", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Automatic feature identification in least-squares policy ite", "item": "https://sciencetostartup.com/paper/automatic-feature-identification-in-least-squares-policy-iteration-using-the-koopman-operator-framework" } ] } ] }

Competitive landscape

A novel reinforcement learning algorithm that automatically learns features for policy iteration, removing the need for manual feature engineering.

Segment

Reinforcement Learning

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Automatic feature identification in least-squares policy iteration using the Koopman operator framework

Automatic feature identification in least-squares policy iteration using the Koopman operator framework

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline