ARXIV:2604.01841 · CLINICAL AI · SUBMITTED 03 APR · 20:50 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Retrieval-aligned Tabular Foundation Models Enable Robust Clinical Risk Prediction in Electronic Health Records Under Real-world Constraints

Minh-Khoi Pham · Thang-Long Nguyen Ho · Thao Thi Phuong Dao · Tai Tan Mai · Minh-Triet Tran · Marie E. Ward · +5 at arXiv

A retrieval-aligned framework for robust clinical risk prediction from electronic health records that overcomes data complexity and imbalance.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A retrieval-aligned framework for robust clinical risk prediction from electronic health records that overcomes data complexity and imbalance.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A retrieval-aligned framework for robust clinical risk prediction from electronic health records that overcomes data complexity and imbalance. While tabular in-context learning (TICL) and retrieval-augmented methods perform well on generic benchmarks, their behavior in…

METHOD

Full abstract

Clinical prediction from structured electronic health records (EHRs) is challenging due to high dimensionality, heterogeneity, class imbalance, and distribution shift. While tabular in-context learning (TICL) and retrieval-augmented methods perform well on generic benchmarks, their behavior in clinical settings remains unclear. We present a multi-cohort EHR benchmark comparing classical, deep tabular, and TICL models across varying data scale, feature dimensionality, outcome rarity, and cross-cohort generalization. PFN-based TICL models are sample-efficient in low-data regimes but degrade under naive distance-based retrieval as heterogeneity and imbalance increase. We propose AWARE, a task-aligned retrieval framework using supervised embedding learning and lightweight adapters. AWARE improves AUPRC by up to 12.2% under extreme imbalance, with gains increasing with data complexity. Our results identify retrieval quality and retrieval-inference alignment as key bottlenecks for deploying tabular in-context learning in clinical prediction.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. AWARE improves AUPRC by up to 12.2% under extreme imbalance, with gains increasing with data complexity. Code availability is flagged in the production record;…

WHY NOW

Clinical AI moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA retrieval-aligned framework for robust clinical risk prediction from electronic health records that overcomes data complexity and imbalance.

Evidence0 refs | 0 sources | 33% coverage

Blockerno shell-level blocker reported

Analysis summary

A retrieval-aligned framework for robust clinical risk prediction from electronic health records that overcomes data complexity and imbalance.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A retrieval-aligned framework for robust clinical risk prediction from electronic health records that overcomes data complexity and imbalance.

Segment

Clinical AI

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "f0b396b1-942d-47c3-bb9f-b9c73201ca17", "arxiv_id": "2604.01841", "canonical_route": "/paper/retrieval-aligned-tabular-foundation-models-enable-robust-clinical-risk-prediction-in-electronic-health-records-under-re", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "retrieval-aligned-tabular-foundation-models-enable-robust-clinical-risk-prediction-in-electronic-health-records-under-re", "endpoints": { "paper_pack": "/api/v1/paper/retrieval-aligned-tabular-foundation-models-enable-robust-clinical-risk-prediction-in-electronic-health-records-under-re/paper-pack", "build_passport": "/api/v1/paper/retrieval-aligned-tabular-foundation-models-enable-robust-clinical-risk-prediction-in-electronic-health-records-under-re/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Retrieval-aligned Tabular Foundation Models Enable Robust Clinical Risk Prediction in Electronic Health Records Under Real-world Constraints", "normalized_query": "2604.01841", "route": "/paper/retrieval-aligned-tabular-foundation-models-enable-robust-clinical-risk-prediction-in-electronic-health-records-under-re", "paper_ref": "retrieval-aligned-tabular-foundation-models-enable-robust-clinical-risk-prediction-in-electronic-health-records-under-re", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/retrieval-aligned-tabular-foundation-models-enable-robust-clinical-risk-prediction-in-electronic-health-records-under-re#webpage", "url": "https://sciencetostartup.com/paper/retrieval-aligned-tabular-foundation-models-enable-robust-clinical-risk-prediction-in-electronic-health-records-under-re", "name": "Retrieval-aligned Tabular Foundation Models Enable Robust Clinical Risk Prediction in Electronic Health Records Under Real-world Constraints", "description": "A retrieval-aligned framework for robust clinical risk prediction from electronic health records that overcomes data complexity and imbalance.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/retrieval-aligned-tabular-foundation-models-enable-robust-clinical-risk-prediction-in-electronic-health-records-under-re#scholarlyArticle", "headline": "Retrieval-aligned Tabular Foundation Models Enable Robust Clinical Risk Prediction in Electronic Health Records Under Real-world Constraints", "description": "A retrieval-aligned framework for robust clinical risk prediction from electronic health records that overcomes data complexity and imbalance.", "url": "https://sciencetostartup.com/paper/retrieval-aligned-tabular-foundation-models-enable-robust-clinical-risk-prediction-in-electronic-health-records-under-re", "sameAs": "https://arxiv.org/abs/2604.01841", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.01841" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-02T09:56:17.000Z", "author": [ { "@type": "Person", "name": "Minh-Khoi Pham" }, { "@type": "Person", "name": "Thang-Long Nguyen Ho" }, { "@type": "Person", "name": "Thao Thi Phuong Dao" }, { "@type": "Person", "name": "Tai Tan Mai" }, { "@type": "Person", "name": "Minh-Triet Tran" }, { "@type": "Person", "name": "Marie E. Ward" }, { "@type": "Person", "name": "Una Geary" }, { "@type": "Person", "name": "Rob Brennan" }, { "@type": "Person", "name": "Nick McDonald" }, { "@type": "Person", "name": "Martin Crane" }, { "@type": "Person", "name": "Marija Bezbradica" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Clinical AI" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Clinical AI", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Retrieval-aligned Tabular Foundation Models Enable Robust Cl", "item": "https://sciencetostartup.com/paper/retrieval-aligned-tabular-foundation-models-enable-robust-clinical-risk-prediction-in-electronic-health-records-under-re" } ] } ] }

Competitive landscape

A retrieval-aligned framework for robust clinical risk prediction from electronic health records that overcomes data complexity and imbalance.

Segment

Clinical AI

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Retrieval-aligned Tabular Foundation Models Enable Robust Clinical Risk Prediction in Electronic Health Records Under Real-world Constraints

Retrieval-aligned Tabular Foundation Models Enable Robust Clinical Risk Prediction in Electronic Health Records Under Real-world Constraints

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline