ARXIV:2605.30911 · LVLM HALLUCINATION · SUBMITTED 01 JUN · 20:23 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

What Makes LVLMs Hallucinate Less? Unveiling the Architectural Factors Behind Hallucination Robustness

Yusheng He · Jizhe Zhou · Xia Du · Zheng Lin · Jun Luo · Jiancheng Lv · arXiv

CoSimUE is a benchmark and framework that links LVLM architectural design choices to specific hallucination types, providing guidance for building more reliable models.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain CoSimUE is a benchmark and framework that links LVLM architectural design choices to specific hallucination types, providing guidance for building more reliable models.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

CoSimUE is a benchmark and framework that links LVLM architectural design choices to specific hallucination types, providing guidance for building more reliable models. But what makes an LVLM hallucinate less?

METHOD

Hallucination remains one of the key challenges undermining the reliability of Large Vision-Language Models (LVLMs). But what makes an LVLM hallucinate less?

Full abstract

Hallucination remains one of the key challenges undermining the reliability of Large Vision-Language Models (LVLMs). But what makes an LVLM hallucinate less? Many existing efforts focus on improving internal components of the model. We argue that hallucination fundamentally stems from how the model architecture is designed. To investigate this, we factor the architecture design into three dimensions: Linguistic Foundation (LF), Visual Representation (VR), and Semantic Alignment (SA), and categorize hallucinations into Co-occurrence, Similarity, and previously overlooked Uncertainty types. Building on this formulation, we propose CoSimUE, a benchmark that creates fine-grained hallucination scenarios through controlled textual perturbations and random perturbations, enabling mapping between design choices and hallucination behaviors. Experiments across 7 design aspects show that: 1) the widely emphasized scaling of model parameters has only limited impact on reducing all three types of hallucinations; 2) larger and better-trained language foundations can reduce co-occurrence hallucinations; 3) stronger visual encoders and higher resolutions mitigate similarity errors; 4) effective alignment strategies alleviate uncertainty hallucinations. 5) Furthermore, cross-dimensional analysis reveals that jointly enhancing visual fidelity and alignment quality yields the most comprehensive improvements. This study provides the first systematic exploration linking architecture-level design to hallucination robustness, offering practical guidance for developing reliable and efficient LVLMs.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Experiments across 7 design aspects show that: 1) the widely emphasized scaling of model parameters has only limited impact on reducing all three types…

WHY NOW

LVLM Hallucination moved forward this cycle; last verified June 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainCoSimUE is a benchmark and framework that links LVLM architectural design choices to specific hallucination types, providing guidance for building more reliable models.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

CoSimUE is a benchmark and framework that links LVLM architectural design choices to specific hallucination types, providing guidance for building more reliable models.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

CoSimUE is a benchmark and framework that links LVLM architectural design choices to specific hallucination types, providing guidance for building more reliable models.

Segment

LVLM Hallucination

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "0856d627-81de-40b7-8131-5d6e265bec08", "arxiv_id": "2605.30911", "canonical_route": "/paper/what-makes-lvlms-hallucinate-less-unveiling-the-architectural-factors-behind-hallucination-robustness", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "what-makes-lvlms-hallucinate-less-unveiling-the-architectural-factors-behind-hallucination-robustness", "endpoints": { "paper_pack": "/api/v1/paper/what-makes-lvlms-hallucinate-less-unveiling-the-architectural-factors-behind-hallucination-robustness/paper-pack", "build_passport": "/api/v1/paper/what-makes-lvlms-hallucinate-less-unveiling-the-architectural-factors-behind-hallucination-robustness/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "What Makes LVLMs Hallucinate Less? Unveiling the Architectural Factors Behind Hallucination Robustness", "normalized_query": "2605.30911", "route": "/paper/what-makes-lvlms-hallucinate-less-unveiling-the-architectural-factors-behind-hallucination-robustness", "paper_ref": "what-makes-lvlms-hallucinate-less-unveiling-the-architectural-factors-behind-hallucination-robustness", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/what-makes-lvlms-hallucinate-less-unveiling-the-architectural-factors-behind-hallucination-robustness#webpage", "url": "https://sciencetostartup.com/paper/what-makes-lvlms-hallucinate-less-unveiling-the-architectural-factors-behind-hallucination-robustness", "name": "What Makes LVLMs Hallucinate Less? Unveiling the Architectural Factors Behind Hallucination Robustness", "description": "CoSimUE is a benchmark and framework that links LVLM architectural design choices to specific hallucination types, providing guidance for building more reliable models.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/what-makes-lvlms-hallucinate-less-unveiling-the-architectural-factors-behind-hallucination-robustness#scholarlyArticle", "headline": "What Makes LVLMs Hallucinate Less? Unveiling the Architectural Factors Behind Hallucination Robustness", "description": "CoSimUE is a benchmark and framework that links LVLM architectural design choices to specific hallucination types, providing guidance for building more reliable models.", "url": "https://sciencetostartup.com/paper/what-makes-lvlms-hallucinate-less-unveiling-the-architectural-factors-behind-hallucination-robustness", "sameAs": "https://arxiv.org/abs/2605.30911", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.30911" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-29T06:47:31.000Z", "author": [ { "@type": "Person", "name": "Yusheng He" }, { "@type": "Person", "name": "Jizhe Zhou" }, { "@type": "Person", "name": "Xia Du" }, { "@type": "Person", "name": "Zheng Lin" }, { "@type": "Person", "name": "Jun Luo" }, { "@type": "Person", "name": "Jiancheng Lv" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LVLM Hallucination" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LVLM Hallucination", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "What Makes LVLMs Hallucinate Less? Unveiling the Architectur", "item": "https://sciencetostartup.com/paper/what-makes-lvlms-hallucinate-less-unveiling-the-architectural-factors-behind-hallucination-robustness" } ] } ] }

Competitive landscape

CoSimUE is a benchmark and framework that links LVLM architectural design choices to specific hallucination types, providing guidance for building more reliable models.

Segment

LVLM Hallucination

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

What Makes LVLMs Hallucinate Less? Unveiling the Architectural Factors Behind Hallucination Robustness

What Makes LVLMs Hallucinate Less? Unveiling the Architectural Factors Behind Hallucination Robustness

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline