ARXIV:2604.11165 · CLINICAL DECISION SUPPORT · SUBMITTED 14 APR · 16:51 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Cost-optimal Sequential Testing via Doubly Robust Q-learning

Doudou Zhou · Yiran Zhang · Dian Jin · Yingye Zheng · Lu Tian · Tianxi Cai · arXiv

This paper introduces a doubly robust Q-learning framework for learning cost-optimal sequential testing policies from retrospective data in clinical decision-making.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain This paper introduces a doubly robust Q-learning framework for learning cost-optimal sequential testing policies from retrospective data in clinical decision-making.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

This paper introduces a doubly robust Q-learning framework for learning cost-optimal sequential testing policies from retrospective data in clinical decision-making. We study the problem of learning cost-optimal sequential decision policies from retrospective data, where…

METHOD

Full abstract

Clinical decision-making often involves selecting tests that are costly, invasive, or time-consuming, motivating individualized, sequential strategies for what to measure and when to stop ascertaining. We study the problem of learning cost-optimal sequential decision policies from retrospective data, where test availability depends on prior results, inducing informative missingness. Under a sequential missing-at-random mechanism, we develop a doubly robust Q-learning framework for estimating optimal policies. The method introduces path-specific inverse probability weights that account for heterogeneous test trajectories and satisfy a normalization property conditional on the observed history. By combining these weights with auxiliary contrast models, we construct orthogonal pseudo-outcomes that enable unbiased policy learning when either the acquisition model or the contrast model is correctly specified. We establish oracle inequalities for the stage-wise contrast estimators, along with convergence rates, regret bounds, and misclassification rates for the learned policy. Simulations demonstrate improved cost-adjusted performance over weighted and complete-case baselines, and an application to a prostate cancer cohort study illustrates how the method reduces testing cost without compromising predictive accuracy.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. We study the problem of learning cost-optimal sequential decision policies from retrospective data, where test availability depends on prior results, inducing informative missingness. Code…

WHY NOW

Clinical Decision Support moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainThis paper introduces a doubly robust Q-learning framework for learning cost-optimal sequential testing policies from retrospective data in clinical decision-making.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

This paper introduces a doubly robust Q-learning framework for learning cost-optimal sequential testing policies from retrospective data in clinical decision-making.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

This paper introduces a doubly robust Q-learning framework for learning cost-optimal sequential testing policies from retrospective data in clinical decision-making.

Segment

Clinical Decision Support

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "2232bd60-1c4d-4c6e-9c38-2b9ba60e9eda", "arxiv_id": "2604.11165", "canonical_route": "/paper/cost-optimal-sequential-testing-via-doubly-robust-q-learning", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "cost-optimal-sequential-testing-via-doubly-robust-q-learning", "endpoints": { "paper_pack": "/api/v1/paper/cost-optimal-sequential-testing-via-doubly-robust-q-learning/paper-pack", "build_passport": "/api/v1/paper/cost-optimal-sequential-testing-via-doubly-robust-q-learning/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Cost-optimal Sequential Testing via Doubly Robust Q-learning", "normalized_query": "2604.11165", "route": "/paper/cost-optimal-sequential-testing-via-doubly-robust-q-learning", "paper_ref": "cost-optimal-sequential-testing-via-doubly-robust-q-learning", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/cost-optimal-sequential-testing-via-doubly-robust-q-learning#webpage", "url": "https://sciencetostartup.com/paper/cost-optimal-sequential-testing-via-doubly-robust-q-learning", "name": "Cost-optimal Sequential Testing via Doubly Robust Q-learning", "description": "This paper introduces a doubly robust Q-learning framework for learning cost-optimal sequential testing policies from retrospective data in clinical decision-making.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/cost-optimal-sequential-testing-via-doubly-robust-q-learning#scholarlyArticle", "headline": "Cost-optimal Sequential Testing via Doubly Robust Q-learning", "description": "This paper introduces a doubly robust Q-learning framework for learning cost-optimal sequential testing policies from retrospective data in clinical decision-making.", "url": "https://sciencetostartup.com/paper/cost-optimal-sequential-testing-via-doubly-robust-q-learning", "sameAs": "https://arxiv.org/abs/2604.11165", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.11165" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-13T08:26:27.000Z", "author": [ { "@type": "Person", "name": "Doudou Zhou" }, { "@type": "Person", "name": "Yiran Zhang" }, { "@type": "Person", "name": "Dian Jin" }, { "@type": "Person", "name": "Yingye Zheng" }, { "@type": "Person", "name": "Lu Tian" }, { "@type": "Person", "name": "Tianxi Cai" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Clinical Decision Support" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Clinical Decision Support", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Cost-optimal Sequential Testing via Doubly Robust Q-learning", "item": "https://sciencetostartup.com/paper/cost-optimal-sequential-testing-via-doubly-robust-q-learning" } ] } ] }

Competitive landscape

This paper introduces a doubly robust Q-learning framework for learning cost-optimal sequential testing policies from retrospective data in clinical decision-making.

Segment

Clinical Decision Support

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Cost-optimal Sequential Testing via Doubly Robust Q-learning

Cost-optimal Sequential Testing via Doubly Robust Q-learning

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline