ARXIV:2604.01705 · MEDICAL AI · SUBMITTED 03 APR · 20:50 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Development and multi-center evaluation of domain-adapted speech recognition for human-AI teaming in real-world gastrointestinal endoscopy

Ruijie Yang · Yan Zhu · Peiyao Fu · Te Luo · Zhihua Wang · Xian Yang · +3 at arXiv

A domain-adapted speech recognition system for real-time human-AI collaboration in gastrointestinal endoscopy, significantly improving accuracy and enabling efficient edge deployment.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A domain-adapted speech recognition system for real-time human-AI collaboration in gastrointestinal endoscopy, significantly improving accuracy and enabling efficient edge deployment.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A domain-adapted speech recognition system for real-time human-AI collaboration in gastrointestinal endoscopy, significantly improving accuracy and enabling efficient edge deployment. Here, we present EndoASR, a domain-adapted ASR system designed for real-time deployment in endoscopic…

METHOD

Full abstract

Automatic speech recognition (ASR) is a critical interface for human-AI interaction in gastrointestinal endoscopy, yet its reliability in real-world clinical settings is limited by domain-specific terminology and complex acoustic conditions. Here, we present EndoASR, a domain-adapted ASR system designed for real-time deployment in endoscopic workflows. We develop a two-stage adaptation strategy based on synthetic endoscopy reports, targeting domain-specific language modeling and noise robustness. In retrospective evaluation across six endoscopists, EndoASR substantially improves both transcription accuracy and clinical usability, reducing character error rate (CER) from 20.52% to 14.14% and increasing medical term accuracy (Med ACC) from 54.30% to 87.59%. In a prospective multi-center study spanning five independent endoscopy centers, EndoASR demonstrates consistent generalization under heterogeneous real-world conditions. Compared with the baseline Paraformer model, CER is reduced from 16.20% to 14.97%, while Med ACC is improved from 61.63% to 84.16%, confirming its robustness in practical deployment scenarios. Notably, EndoASR achieves a real-time factor (RTF) of 0.005, significantly faster than Whisper-large-v3 (RTF 0.055), while maintaining a compact model size of 220M parameters, enabling efficient edge deployment. Furthermore, integration with large language models demonstrates that improved ASR quality directly enhances downstream structured information extraction and clinician-AI interaction. These results demonstrate that domain-adapted ASR can serve as a reliable interface for human-AI teaming in gastrointestinal endoscopy, with consistent performance validated across multi-center real-world clinical settings.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. In retrospective evaluation across six endoscopists, EndoASR substantially improves both transcription accuracy and clinical usability, reducing character error rate (CER) from 20.52% to 14.14%…

WHY NOW

Medical AI moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA domain-adapted speech recognition system for real-time human-AI collaboration in gastrointestinal endoscopy, significantly improving accuracy and enabling efficient edge deployment.

Evidence0 refs | 0 sources | 33% coverage

Blockerno shell-level blocker reported

Analysis summary

A domain-adapted speech recognition system for real-time human-AI collaboration in gastrointestinal endoscopy, significantly improving accuracy and enabling efficient edge deployment.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A domain-adapted speech recognition system for real-time human-AI collaboration in gastrointestinal endoscopy, significantly improving accuracy and enabling efficient edge deployment.

Segment

Medical AI

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "67890d4f-248d-46e2-8824-c9eeca9b8c50", "arxiv_id": "2604.01705", "canonical_route": "/paper/development-and-multi-center-evaluation-of-domain-adapted-speech-recognition-for-human-ai-teaming-in-real-world-gastroin", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "development-and-multi-center-evaluation-of-domain-adapted-speech-recognition-for-human-ai-teaming-in-real-world-gastroin", "endpoints": { "paper_pack": "/api/v1/paper/development-and-multi-center-evaluation-of-domain-adapted-speech-recognition-for-human-ai-teaming-in-real-world-gastroin/paper-pack", "build_passport": "/api/v1/paper/development-and-multi-center-evaluation-of-domain-adapted-speech-recognition-for-human-ai-teaming-in-real-world-gastroin/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Development and multi-center evaluation of domain-adapted speech recognition for human-AI teaming in real-world gastrointestinal endoscopy", "normalized_query": "2604.01705", "route": "/paper/development-and-multi-center-evaluation-of-domain-adapted-speech-recognition-for-human-ai-teaming-in-real-world-gastroin", "paper_ref": "development-and-multi-center-evaluation-of-domain-adapted-speech-recognition-for-human-ai-teaming-in-real-world-gastroin", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/development-and-multi-center-evaluation-of-domain-adapted-speech-recognition-for-human-ai-teaming-in-real-world-gastroin#webpage", "url": "https://sciencetostartup.com/paper/development-and-multi-center-evaluation-of-domain-adapted-speech-recognition-for-human-ai-teaming-in-real-world-gastroin", "name": "Development and multi-center evaluation of domain-adapted speech recognition for human-AI teaming in real-world gastrointestinal endoscopy", "description": "A domain-adapted speech recognition system for real-time human-AI collaboration in gastrointestinal endoscopy, significantly improving accuracy and enabling efficient edge deployment.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/development-and-multi-center-evaluation-of-domain-adapted-speech-recognition-for-human-ai-teaming-in-real-world-gastroin#scholarlyArticle", "headline": "Development and multi-center evaluation of domain-adapted speech recognition for human-AI teaming in real-world gastrointestinal endoscopy", "description": "A domain-adapted speech recognition system for real-time human-AI collaboration in gastrointestinal endoscopy, significantly improving accuracy and enabling efficient edge deployment.", "url": "https://sciencetostartup.com/paper/development-and-multi-center-evaluation-of-domain-adapted-speech-recognition-for-human-ai-teaming-in-real-world-gastroin", "sameAs": "https://arxiv.org/abs/2604.01705", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.01705" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-02T07:13:02.000Z", "author": [ { "@type": "Person", "name": "Ruijie Yang" }, { "@type": "Person", "name": "Yan Zhu" }, { "@type": "Person", "name": "Peiyao Fu" }, { "@type": "Person", "name": "Te Luo" }, { "@type": "Person", "name": "Zhihua Wang" }, { "@type": "Person", "name": "Xian Yang" }, { "@type": "Person", "name": "Quanlin Li" }, { "@type": "Person", "name": "Pinghong Zhou" }, { "@type": "Person", "name": "Shuo Wang" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Medical AI" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Medical AI", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Development and multi-center evaluation of domain-adapted sp", "item": "https://sciencetostartup.com/paper/development-and-multi-center-evaluation-of-domain-adapted-speech-recognition-for-human-ai-teaming-in-real-world-gastroin" } ] } ] }

Competitive landscape

A domain-adapted speech recognition system for real-time human-AI collaboration in gastrointestinal endoscopy, significantly improving accuracy and enabling efficient edge deployment.

Segment

Medical AI

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Development and multi-center evaluation of domain-adapted speech recognition for human-AI teaming in real-world gastrointestinal endoscopy

Development and multi-center evaluation of domain-adapted speech recognition for human-AI teaming in real-world gastrointestinal endoscopy

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline