ARXIV:2604.12237 · DRUG DISCOVERY AI · SUBMITTED 15 APR · 17:00 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization

Ziqing Wang · Yibo Wen · Abhishek Pandy · Han Liu · Kaize Ding · arXiv

A memory-augmented reinforcement learning agent for sample-efficient molecular optimization in drug discovery.

Ship in 2-4 weeks›Score8.0Evidence unverified

Opportunity summary

Pain A memory-augmented reinforcement learning agent for sample-efficient molecular optimization in drug discovery.

Evidence 0 refs | 4 sources | 67% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A memory-augmented reinforcement learning agent for sample-efficient molecular optimization in drug discovery. However, each oracle evaluation is expensive, making sample efficiency a key challenge for existing methods under a limited oracle budget.

METHOD

Full abstract

In drug discovery, molecular optimization aims to iteratively refine a lead compound to improve molecular properties while preserving structural similarity to the original molecule. However, each oracle evaluation is expensive, making sample efficiency a key challenge for existing methods under a limited oracle budget. Trial-and-error approaches require many oracle calls, while methods that leverage external knowledge tend to reuse familiar templates and struggle on challenging objectives. A key missing piece is long-term memory that can ground decisions and provide reusable insights for future optimizations. To address this, we present MolMem (\textbf{Mol}ecular optimization with \textbf{Mem}ory), a multi-turn agentic reinforcement learning (RL) framework with a dual-memory system. Specifically, MolMem uses Static Exemplar Memory to retrieve relevant exemplars for cold-start grounding, and Evolving Skill Memory to distill successful trajectories into reusable strategies. Built on this memory-augmented formulation, we train the policy with dense step-wise rewards, turning costly rollouts into long-term knowledge that improves future optimization. Extensive experiments show that MolMem achieves 90\% success on single-property tasks (1.5$\times$ over the best baseline) and 52\% on multi-property tasks using only 500 oracle calls. Our code is available at https://github.com/REAL-Lab-NU/MolMem.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. In drug discovery, molecular optimization aims to iteratively refine a lead compound to improve molecular properties while preserving structural similarity to the original molecule.…

WHY NOW

Drug Discovery AI moved forward this cycle; last verified April 2026. Public score 8.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainA memory-augmented reinforcement learning agent for sample-efficient molecular optimization in drug discovery.

Evidence0 refs | 4 sources | 67% coverage

Blockerno shell-level blocker reported

Analysis summary

A memory-augmented reinforcement learning agent for sample-efficient molecular optimization in drug discovery.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A memory-augmented reinforcement learning agent for sample-efficient molecular optimization in drug discovery.

Segment

Drug Discovery AI

Adoption evidence

Public code linked for build inspection

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "a628b218-c5bd-454c-b97c-89f4ae943b73", "arxiv_id": "2604.12237", "canonical_route": "/paper/molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization", "endpoints": { "paper_pack": "/api/v1/paper/molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization/paper-pack", "build_passport": "/api/v1/paper/molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization", "normalized_query": "2604.12237", "route": "/paper/molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization", "paper_ref": "molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization#webpage", "url": "https://sciencetostartup.com/paper/molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization", "name": "MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization", "description": "A memory-augmented reinforcement learning agent for sample-efficient molecular optimization in drug discovery.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization#scholarlyArticle", "headline": "MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization", "description": "A memory-augmented reinforcement learning agent for sample-efficient molecular optimization in drug discovery.", "url": "https://sciencetostartup.com/paper/molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization", "sameAs": "https://arxiv.org/abs/2604.12237", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.12237" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-14T03:24:26.000Z", "author": [ { "@type": "Person", "name": "Ziqing Wang" }, { "@type": "Person", "name": "Yibo Wen" }, { "@type": "Person", "name": "Abhishek Pandy" }, { "@type": "Person", "name": "Han Liu" }, { "@type": "Person", "name": "Kaize Ding" } ], "codeRepository": "https://github.com/REAL-Lab-NU/MolMem", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Drug Discovery AI" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code, repo url" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization#software", "name": "MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization - Source Code", "description": "A memory-augmented reinforcement learning agent for sample-efficient molecular optimization in drug discovery.", "codeRepository": "https://github.com/REAL-Lab-NU/MolMem", "url": "https://github.com/REAL-Lab-NU/MolMem" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Drug Discovery AI", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "MolMem: Memory-Augmented Agentic Reinforcement Learning for ", "item": "https://sciencetostartup.com/paper/molmem-memory-augmented-agentic-reinforcement-learning-for-sample-efficient-molecular-optimization" } ] } ] }

Competitive landscape

A memory-augmented reinforcement learning agent for sample-efficient molecular optimization in drug discovery.

Segment

Drug Discovery AI

Adoption evidence

Public code linked for build inspection

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization

MolMem: Memory-Augmented Agentic Reinforcement Learning for Sample-Efficient Molecular Optimization

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline