ARXIV:2603.28130 · DOCUMENT PARSING · SUBMITTED 31 MAR · 20:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

Zhang Li · Zhibo Lin · Qiang Liu · Ziyang Zhang · Shuo Zhang · Zidun Guo · +4 at arXiv

A benchmark and evaluation of multilingual document parsing models reveals significant performance gaps, highlighting opportunities for more inclusive and deployment-ready parsing systems.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A benchmark and evaluation of multilingual document parsing models reveals significant performance gaps, highlighting opportunities for more inclusive and deployment-ready parsing systems.

Evidence 67 refs | 4 sources | 83% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A benchmark and evaluation of multilingual document parsing models reveals significant performance gaps, highlighting opportunities for more inclusive and deployment-ready parsing systems. Document parsing has made remarkable strides, yet almost exclusively on clean, digital,…

METHOD

Full abstract

We introduce Multilingual Document Parsing Benchmark, the first benchmark for multilingual digital and photographed document parsing. Document parsing has made remarkable strides, yet almost exclusively on clean, digital, well-formatted pages in a handful of dominant languages. No systematic benchmark exists to evaluate how models perform on digital and photographed documents across diverse scripts and low-resource languages. MDPBench comprises 3,400 document images spanning 17 languages, diverse scripts, and varied photographic conditions, with high-quality annotations produced through a rigorous pipeline of expert model labeling, manual correction, and human verification. To ensure fair comparison and prevent data leakage, we maintain separate public and private evaluation splits. Our comprehensive evaluation of both open-source and closed-source models uncovers a striking finding: while closed-source models (notably Gemini3-Pro) prove relatively robust, open-source alternatives suffer dramatic performance collapse, particularly on non-Latin scripts and real-world photographed documents, with an average drop of 17.8% on photographed documents and 14.0% on non-Latin scripts. These results reveal significant performance imbalances across languages and conditions, and point to concrete directions for building more inclusive, deployment-ready parsing systems. Source available at https://github.com/Yuliang-Liu/MultimodalOCR.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. These results reveal significant performance imbalances across languages and conditions, and point to concrete directions for building more inclusive, deployment-ready parsing systems. A public…

WHY NOW

Document Parsing moved forward this cycle; last verified April 2026. Public score 7.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA benchmark and evaluation of multilingual document parsing models reveals significant performance gaps, highlighting opportunities for more inclusive and deployment-ready parsing systems.

Evidence67 refs | 4 sources | 83% coverage

Blockerno shell-level blocker reported

Analysis summary

A benchmark and evaluation of multilingual document parsing models reveals significant performance gaps, highlighting opportunities for more inclusive and deployment-ready parsing systems.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A benchmark and evaluation of multilingual document parsing models reveals significant performance gaps, highlighting opportunities for more inclusive and deployment-ready parsing systems.

Segment

Document Parsing

Adoption evidence

Public code linked for build inspection

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "244dfc17-81d5-4869-b932-158b8f99b0cc", "arxiv_id": "2603.28130", "canonical_route": "/paper/mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios", "endpoints": { "paper_pack": "/api/v1/paper/mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios/paper-pack", "build_passport": "/api/v1/paper/mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios", "normalized_query": "2603.28130", "route": "/paper/mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios", "paper_ref": "mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios#webpage", "url": "https://sciencetostartup.com/paper/mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios", "name": "MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios", "description": "A benchmark and evaluation of multilingual document parsing models reveals significant performance gaps, highlighting opportunities for more inclusive and deployment-ready parsing systems.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios#scholarlyArticle", "headline": "MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios", "description": "A benchmark and evaluation of multilingual document parsing models reveals significant performance gaps, highlighting opportunities for more inclusive and deployment-ready parsing systems.", "url": "https://sciencetostartup.com/paper/mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios", "sameAs": "https://arxiv.org/abs/2603.28130", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.28130" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-30T07:47:46.000Z", "author": [ { "@type": "Person", "name": "Zhang Li" }, { "@type": "Person", "name": "Zhibo Lin" }, { "@type": "Person", "name": "Qiang Liu" }, { "@type": "Person", "name": "Ziyang Zhang" }, { "@type": "Person", "name": "Shuo Zhang" }, { "@type": "Person", "name": "Zidun Guo" }, { "@type": "Person", "name": "Jiajun Song" }, { "@type": "Person", "name": "Jiarui Zhang" }, { "@type": "Person", "name": "Xiang Bai" }, { "@type": "Person", "name": "Yuliang Liu" } ], "codeRepository": "https://github.com/Yuliang-Liu/MultimodalOCR", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Document Parsing" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code, repo url" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios#software", "name": "MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios - Source Code", "description": "A benchmark and evaluation of multilingual document parsing models reveals significant performance gaps, highlighting opportunities for more inclusive and deployment-ready parsing systems.", "codeRepository": "https://github.com/Yuliang-Liu/MultimodalOCR", "url": "https://github.com/Yuliang-Liu/MultimodalOCR" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Document Parsing", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "MDPBench: A Benchmark for Multilingual Document Parsing in R", "item": "https://sciencetostartup.com/paper/mdpbench-a-benchmark-for-multilingual-document-parsing-in-real-world-scenarios" } ] } ] }

Competitive landscape

A benchmark and evaluation of multilingual document parsing models reveals significant performance gaps, highlighting opportunities for more inclusive and deployment-ready parsing systems.

Segment

Document Parsing

Adoption evidence

Public code linked for build inspection

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

MDPBench: A Benchmark for Multilingual Document Parsing in Real-World Scenarios

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline