ARXIV:2604.26419 · VISION-LANGUAGE MODELS · SUBMITTED 30 APR · 15:13 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Delineating Knowledge Boundaries for Honest Large Vision-Language Models

Junru Song · Yimeng Hu · Yijing Chen · Huining Li · Qian Li · Lizhen Cui · +1 at arXiv

Enhance large vision-language models to refuse queries beyond their knowledge, improving trustworthiness for specialized domains.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain Enhance large vision-language models to refuse queries beyond their knowledge, improving trustworthiness for specialized domains.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Enhance large vision-language models to refuse queries beyond their knowledge, improving trustworthiness for specialized domains. Moreover, current models exhibit a weak capacity to refuse queries that exceed their parametric knowledge.

METHOD

Full abstract

Large Vision-Language Models (VLMs) have achieved remarkable multimodal performance yet remain prone to factual hallucinations, particularly in long-tail or specialized domains. Moreover, current models exhibit a weak capacity to refuse queries that exceed their parametric knowledge. In this paper, we propose a systematic framework to enhance the refusal capability of VLMs when facing such unknown questions. We first curate a model-specific "Visual-Idk" (Visual-I don't know) dataset, leveraging multi-sample consistency probing to distinguish between known and unknown facts. We then align the model using supervised fine-tuning followed by preference-aware optimization (e.g., DPO, ORPO) to effectively delineate its knowledge boundaries. Results on the Visual-Idk dataset show our method improves the Truthful Rate from 57.9\% to 67.3\%. Additionally, internal probing also demonstrates that the model genuinely recognizes its boundaries instead of just memorizing refusal patterns. Our framework further generalizes to out-of-distribution medical and perceptual domains, providing a robust path toward more trustworthy and prudent visual assistants.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Results on the Visual-Idk dataset show our method improves the Truthful Rate from 57.9\% to 67.3\%. Code availability is flagged in the production record;…

WHY NOW

Vision-Language Models moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainEnhance large vision-language models to refuse queries beyond their knowledge, improving trustworthiness for specialized domains.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

Enhance large vision-language models to refuse queries beyond their knowledge, improving trustworthiness for specialized domains.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Enhance large vision-language models to refuse queries beyond their knowledge, improving trustworthiness for specialized domains.

Segment

Vision-Language Models

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "5c4247da-cec6-4215-8521-925cc1d20950", "arxiv_id": "2604.26419", "canonical_route": "/paper/delineating-knowledge-boundaries-for-honest-large-vision-language-models", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "delineating-knowledge-boundaries-for-honest-large-vision-language-models", "endpoints": { "paper_pack": "/api/v1/paper/delineating-knowledge-boundaries-for-honest-large-vision-language-models/paper-pack", "build_passport": "/api/v1/paper/delineating-knowledge-boundaries-for-honest-large-vision-language-models/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Delineating Knowledge Boundaries for Honest Large Vision-Language Models", "normalized_query": "2604.26419", "route": "/paper/delineating-knowledge-boundaries-for-honest-large-vision-language-models", "paper_ref": "delineating-knowledge-boundaries-for-honest-large-vision-language-models", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/delineating-knowledge-boundaries-for-honest-large-vision-language-models#webpage", "url": "https://sciencetostartup.com/paper/delineating-knowledge-boundaries-for-honest-large-vision-language-models", "name": "Delineating Knowledge Boundaries for Honest Large Vision-Language Models", "description": "Enhance large vision-language models to refuse queries beyond their knowledge, improving trustworthiness for specialized domains.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/delineating-knowledge-boundaries-for-honest-large-vision-language-models#scholarlyArticle", "headline": "Delineating Knowledge Boundaries for Honest Large Vision-Language Models", "description": "Enhance large vision-language models to refuse queries beyond their knowledge, improving trustworthiness for specialized domains.", "url": "https://sciencetostartup.com/paper/delineating-knowledge-boundaries-for-honest-large-vision-language-models", "sameAs": "https://arxiv.org/abs/2604.26419", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.26419" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-29T08:29:44.000Z", "author": [ { "@type": "Person", "name": "Junru Song" }, { "@type": "Person", "name": "Yimeng Hu" }, { "@type": "Person", "name": "Yijing Chen" }, { "@type": "Person", "name": "Huining Li" }, { "@type": "Person", "name": "Qian Li" }, { "@type": "Person", "name": "Lizhen Cui" }, { "@type": "Person", "name": "Yuntao Du" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Vision-Language Models" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Vision-Language Models", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Delineating Knowledge Boundaries for Honest Large Vision-Lan", "item": "https://sciencetostartup.com/paper/delineating-knowledge-boundaries-for-honest-large-vision-language-models" } ] } ] }

Competitive landscape

Enhance large vision-language models to refuse queries beyond their knowledge, improving trustworthiness for specialized domains.

Segment

Vision-Language Models

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Delineating Knowledge Boundaries for Honest Large Vision-Language Models

Delineating Knowledge Boundaries for Honest Large Vision-Language Models

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline