ARXIV:2604.01764 · COGNITIVE VISUAL REASONING · SUBMITTED 03 APR · 20:50 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Hidden Meanings in Plain Sight: RebusBench for Evaluating Cognitive Visual Reasoning

Seyed Amir Kasaei · Arash Marioriyad · Mahbod Khaleti · MohammadAmin Fazli · Mahdieh Soleymani Baghshah · Mohammad Hossein Rohban · arXiv

A new benchmark and evaluation framework for visual reasoning tasks that current state-of-the-art models fail at, highlighting a critical gap in cognitive integration.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A new benchmark and evaluation framework for visual reasoning tasks that current state-of-the-art models fail at, highlighting a critical gap in cognitive integration.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A new benchmark and evaluation framework for visual reasoning tasks that current state-of-the-art models fail at, highlighting a critical gap in cognitive integration. However, a critical cognitive gap emerges when the visual input serves…

METHOD

Full abstract

Large Vision-Language Models (LVLMs) have achieved remarkable proficiency in explicit visual recognition, effectively describing what is directly visible in an image. However, a critical cognitive gap emerges when the visual input serves only as a clue rather than the answer. We identify that current models struggle with the complex, multi-step reasoning required to solve problems where information is not explicitly depicted. Successfully solving a rebus puzzle requires a distinct cognitive workflow: the model must extract visual and textual attributes, retrieve linguistic prior knowledge (such as idioms), and perform abstract mapping to synthesize these elements into a meaning that exists outside the pixel space. To evaluate this neurosymbolic capability, we introduce RebusBench, a benchmark of 1,164 puzzles designed to test this specific integration of perception and knowledge. Our evaluation of state-of-the-art models (including Qwen, InternVL, and LLaVA) shows a severe deficiency: performance saturates below 10% Exact Match and 20% semantic accuracy, with no significant improvement observed from model scaling or In-Context Learning (ICL). These findings suggest that while models possess the necessary visual and linguistic components, they lack the cognitive reasoning glue to connect them. Project page available at https://amirkasaei.com/rebusbench/.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Our evaluation of state-of-the-art models (including Qwen, InternVL, and LLaVA) shows a severe deficiency: performance saturates below 10% Exact Match and 20% semantic accuracy,…

WHY NOW

Cognitive Visual Reasoning moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA new benchmark and evaluation framework for visual reasoning tasks that current state-of-the-art models fail at, highlighting a critical gap in cognitive integration.

Evidence0 refs | 0 sources | 33% coverage

Blockerno shell-level blocker reported

Analysis summary

A new benchmark and evaluation framework for visual reasoning tasks that current state-of-the-art models fail at, highlighting a critical gap in cognitive integration.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Hidden Meanings in Plain Sight: RebusBench for Evaluating Cognitive Visual Reasoning

Seyed Amir Kasaei · Arash Marioriyad · Mahbod Khaleti · MohammadAmin Fazli · Mahdieh Soleymani Baghshah · Mohammad Hossein Rohban · arXiv

A new benchmark and evaluation framework for visual reasoning tasks that current state-of-the-art models fail at, highlighting a critical gap in cognitive integration.

Competitive landscape

A new benchmark and evaluation framework for visual reasoning tasks that current state-of-the-art models fail at, highlighting a critical gap in cognitive integration.

Segment

Cognitive Visual Reasoning

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "32792b5b-d9fb-4863-9df8-41cb63790ab3", "arxiv_id": "2604.01764", "canonical_route": "/paper/hidden-meanings-in-plain-sight-rebusbench-for-evaluating-cognitive-visual-reasoning", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "hidden-meanings-in-plain-sight-rebusbench-for-evaluating-cognitive-visual-reasoning", "endpoints": { "paper_pack": "/api/v1/paper/hidden-meanings-in-plain-sight-rebusbench-for-evaluating-cognitive-visual-reasoning/paper-pack", "build_passport": "/api/v1/paper/hidden-meanings-in-plain-sight-rebusbench-for-evaluating-cognitive-visual-reasoning/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Hidden Meanings in Plain Sight: RebusBench for Evaluating Cognitive Visual Reasoning", "normalized_query": "2604.01764", "route": "/paper/hidden-meanings-in-plain-sight-rebusbench-for-evaluating-cognitive-visual-reasoning", "paper_ref": "hidden-meanings-in-plain-sight-rebusbench-for-evaluating-cognitive-visual-reasoning", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/hidden-meanings-in-plain-sight-rebusbench-for-evaluating-cognitive-visual-reasoning#webpage", "url": "https://sciencetostartup.com/paper/hidden-meanings-in-plain-sight-rebusbench-for-evaluating-cognitive-visual-reasoning", "name": "Hidden Meanings in Plain Sight: RebusBench for Evaluating Cognitive Visual Reasoning", "description": "A new benchmark and evaluation framework for visual reasoning tasks that current state-of-the-art models fail at, highlighting a critical gap in cognitive integration.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/hidden-meanings-in-plain-sight-rebusbench-for-evaluating-cognitive-visual-reasoning#scholarlyArticle", "headline": "Hidden Meanings in Plain Sight: RebusBench for Evaluating Cognitive Visual Reasoning", "description": "A new benchmark and evaluation framework for visual reasoning tasks that current state-of-the-art models fail at, highlighting a critical gap in cognitive integration.", "url": "https://sciencetostartup.com/paper/hidden-meanings-in-plain-sight-rebusbench-for-evaluating-cognitive-visual-reasoning", "sameAs": "https://arxiv.org/abs/2604.01764", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.01764" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-02T08:33:13.000Z", "author": [ { "@type": "Person", "name": "Seyed Amir Kasaei" }, { "@type": "Person", "name": "Arash Marioriyad" }, { "@type": "Person", "name": "Mahbod Khaleti" }, { "@type": "Person", "name": "MohammadAmin Fazli" }, { "@type": "Person", "name": "Mahdieh Soleymani Baghshah" }, { "@type": "Person", "name": "Mohammad Hossein Rohban" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Cognitive Visual Reasoning" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Cognitive Visual Reasoning", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Hidden Meanings in Plain Sight: RebusBench for Evaluating Co", "item": "https://sciencetostartup.com/paper/hidden-meanings-in-plain-sight-rebusbench-for-evaluating-cognitive-visual-reasoning" } ] } ] }

Competitive landscape

A new benchmark and evaluation framework for visual reasoning tasks that current state-of-the-art models fail at, highlighting a critical gap in cognitive integration.

Segment

Cognitive Visual Reasoning

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Hidden Meanings in Plain Sight: RebusBench for Evaluating Cognitive Visual Reasoning

Hidden Meanings in Plain Sight: RebusBench for Evaluating Cognitive Visual Reasoning

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline