ARXIV:2605.03103 · MEDICAL AI · SUBMITTED 06 MAY · 20:25 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

MedStruct-S: A Benchmark for Key Discovery, Key-Conditioned QA and Semi-Structured Extraction from OCR Clinical Reports

Yingyun Li · Yu Wang · Haiyang Qian · arXiv

Introducing MedStruct-S, a benchmark for evaluating information extraction from noisy clinical reports, enabling robust AI for patient history reconstruction.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain Introducing MedStruct-S, a benchmark for evaluating information extraction from noisy clinical reports, enabling robust AI for patient history reconstruction.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Introducing MedStruct-S, a benchmark for evaluating information extraction from noisy clinical reports, enabling robust AI for patient history reconstruction. In practice, this scenario commonly involves three tasks: (i) field-header (key) discovery, (ii) key-conditioned question…

METHOD

Full abstract

Semi-structured information extraction (IE) from OCR-derived clinical reports is crucial for efficiently reconstructing patients' longitudinal medical histories. In practice, this scenario commonly involves three tasks: (i) field-header (key) discovery, (ii) key-conditioned question answering (QA), and (iii) end-to-end key-value pair extraction. However, existing evaluations often under-model two factors: heterogeneous and incompletely known key representations, and OCR-induced noise. This makes it difficult to assess model robustness in real-world settings. We present MedStruct-S, a benchmark specifically designed to evaluate these tasks under unknown keys and OCR noise. MedStruct-S contains 3,582 annotated real-world clinical report pages. Using MedStruct-S, we benchmark two representative paradigms: encoder-only sequence labeling with post-processing and decoder-only structured generation, covering four encoder-only and five decoder-only models spanning 0.11B to 103B parameters. Our results show that encoder-only models achieve the best performance for non-null-value key-conditioned QA despite being substantially smaller than decoder-only models. When comparing models of similar order of magnitude, encoder-only models still perform better overall. Without controlling for model scale, fine-tuned decoder-only models deliver the strongest overall results. These findings show that the benchmark provides a reliable and practical basis for selecting and comparing models across different semi-structured IE settings.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Our results show that encoder-only models achieve the best performance for non-null-value key-conditioned QA despite being substantially smaller than decoder-only models. Code availability is…

WHY NOW

Medical AI moved forward this cycle; last verified May 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainIntroducing MedStruct-S, a benchmark for evaluating information extraction from noisy clinical reports, enabling robust AI for patient history reconstruction.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

Introducing MedStruct-S, a benchmark for evaluating information extraction from noisy clinical reports, enabling robust AI for patient history reconstruction.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Introducing MedStruct-S, a benchmark for evaluating information extraction from noisy clinical reports, enabling robust AI for patient history reconstruction.

Segment

Medical AI

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "df368dd3-2eab-45a6-b13b-811cc81e4786", "arxiv_id": "2605.03103", "canonical_route": "/paper/medstruct-s-a-benchmark-for-key-discovery-key-conditioned-qa-and-semi-structured-extraction-from-ocr-clinical-reports", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "medstruct-s-a-benchmark-for-key-discovery-key-conditioned-qa-and-semi-structured-extraction-from-ocr-clinical-reports", "endpoints": { "paper_pack": "/api/v1/paper/medstruct-s-a-benchmark-for-key-discovery-key-conditioned-qa-and-semi-structured-extraction-from-ocr-clinical-reports/paper-pack", "build_passport": "/api/v1/paper/medstruct-s-a-benchmark-for-key-discovery-key-conditioned-qa-and-semi-structured-extraction-from-ocr-clinical-reports/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "MedStruct-S: A Benchmark for Key Discovery, Key-Conditioned QA and Semi-Structured Extraction from OCR Clinical Reports", "normalized_query": "2605.03103", "route": "/paper/medstruct-s-a-benchmark-for-key-discovery-key-conditioned-qa-and-semi-structured-extraction-from-ocr-clinical-reports", "paper_ref": "medstruct-s-a-benchmark-for-key-discovery-key-conditioned-qa-and-semi-structured-extraction-from-ocr-clinical-reports", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/medstruct-s-a-benchmark-for-key-discovery-key-conditioned-qa-and-semi-structured-extraction-from-ocr-clinical-reports#webpage", "url": "https://sciencetostartup.com/paper/medstruct-s-a-benchmark-for-key-discovery-key-conditioned-qa-and-semi-structured-extraction-from-ocr-clinical-reports", "name": "MedStruct-S: A Benchmark for Key Discovery, Key-Conditioned QA and Semi-Structured Extraction from OCR Clinical Reports", "description": "Introducing MedStruct-S, a benchmark for evaluating information extraction from noisy clinical reports, enabling robust AI for patient history reconstruction.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/medstruct-s-a-benchmark-for-key-discovery-key-conditioned-qa-and-semi-structured-extraction-from-ocr-clinical-reports#scholarlyArticle", "headline": "MedStruct-S: A Benchmark for Key Discovery, Key-Conditioned QA and Semi-Structured Extraction from OCR Clinical Reports", "description": "Introducing MedStruct-S, a benchmark for evaluating information extraction from noisy clinical reports, enabling robust AI for patient history reconstruction.", "url": "https://sciencetostartup.com/paper/medstruct-s-a-benchmark-for-key-discovery-key-conditioned-qa-and-semi-structured-extraction-from-ocr-clinical-reports", "sameAs": "https://arxiv.org/abs/2605.03103", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.03103" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-04T19:37:21.000Z", "author": [ { "@type": "Person", "name": "Yingyun Li" }, { "@type": "Person", "name": "Yu Wang" }, { "@type": "Person", "name": "Haiyang Qian" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Medical AI" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Medical AI", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "MedStruct-S: A Benchmark for Key Discovery, Key-Conditioned ", "item": "https://sciencetostartup.com/paper/medstruct-s-a-benchmark-for-key-discovery-key-conditioned-qa-and-semi-structured-extraction-from-ocr-clinical-reports" } ] } ] }

Competitive landscape

Introducing MedStruct-S, a benchmark for evaluating information extraction from noisy clinical reports, enabling robust AI for patient history reconstruction.

Segment

Medical AI

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

MedStruct-S: A Benchmark for Key Discovery, Key-Conditioned QA and Semi-Structured Extraction from OCR Clinical Reports

MedStruct-S: A Benchmark for Key Discovery, Key-Conditioned QA and Semi-Structured Extraction from OCR Clinical Reports

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline