ARXIV:2603.02540 · LLM EVALUATION · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities

arXiv

Introduce NeuroCognition benchmark for evaluating LLM cognitive abilities distinct from existing benchmarks.

Blocked on Code›Score5.0Evidence unverified

Opportunity summary

Pain Introduce NeuroCognition benchmark for evaluating LLM cognitive abilities distinct from existing benchmarks.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Introduce NeuroCognition benchmark for evaluating LLM cognitive abilities distinct from existing benchmarks. This is because current benchmarks focus on task completion, failing to probe the foundational cognitive abilities that highlight these behaviors.

METHOD

Full abstract

Large language models (LLMs) exhibit a unified "general factor" of capability across 10 benchmarks, a finding confirmed by our factor analysis of 156 models, yet they still struggle with simple, trivial tasks for humans. This is because current benchmarks focus on task completion, failing to probe the foundational cognitive abilities that highlight these behaviors. We address this by introducing the NeuroCognition benchmark, grounded in three adapted neuropsychological tests: Raven's Progressive Matrices (abstract relational reasoning), Spatial Working Memory (maintenance and systematic search), and the Wisconsin Card Sorting Test (cognitive flexibility). Our evaluation reveals that while models perform strongly on text, their performance degrades for images and with increased complexity. Furthermore, we observe that complex reasoning is not universally beneficial, whereas simple, human-like strategies yield partial gains. We also find that NeuroCognition correlates positively with standard general-capability benchmarks, while still measuring distinct cognitive abilities beyond them. Overall, NeuroCognition emphasizes where current LLMs align with human-like intelligence and where they lack core adaptive cognition, showing the potential to serve as a verifiable, scalable source for improving LLMs.

RESULT

ScienceToStartup currently rates this 5.0/10 on the public viability pass. Overall, NeuroCognition emphasizes where current LLMs align with human-like intelligence and where they lack core adaptive cognition, showing the potential to serve as a…

WHY NOW

LLM Evaluation moved forward this cycle; last verified April 2026. Public score 5.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score5.0

PainIntroduce NeuroCognition benchmark for evaluating LLM cognitive abilities distinct from existing benchmarks.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

Introduce NeuroCognition benchmark for evaluating LLM cognitive abilities distinct from existing benchmarks.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

Introduce NeuroCognition benchmark for evaluating LLM cognitive abilities distinct from existing benchmarks.

Segment

LLM Evaluation

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "8926bfe7-18aa-43fb-b134-646d0a497a1f", "arxiv_id": "2603.02540", "canonical_route": "/paper/a-neuropsychologically-grounded-evaluation-of-llm-cognitive-abilities", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "a-neuropsychologically-grounded-evaluation-of-llm-cognitive-abilities", "endpoints": { "paper_pack": "/api/v1/paper/a-neuropsychologically-grounded-evaluation-of-llm-cognitive-abilities/paper-pack", "build_passport": "/api/v1/paper/a-neuropsychologically-grounded-evaluation-of-llm-cognitive-abilities/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities", "normalized_query": "2603.02540", "route": "/paper/a-neuropsychologically-grounded-evaluation-of-llm-cognitive-abilities", "paper_ref": "a-neuropsychologically-grounded-evaluation-of-llm-cognitive-abilities", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/a-neuropsychologically-grounded-evaluation-of-llm-cognitive-abilities#webpage", "url": "https://sciencetostartup.com/paper/a-neuropsychologically-grounded-evaluation-of-llm-cognitive-abilities", "name": "A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities", "description": "Introduce NeuroCognition benchmark for evaluating LLM cognitive abilities distinct from existing benchmarks.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/a-neuropsychologically-grounded-evaluation-of-llm-cognitive-abilities#scholarlyArticle", "headline": "A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities", "description": "Introduce NeuroCognition benchmark for evaluating LLM cognitive abilities distinct from existing benchmarks.", "url": "https://sciencetostartup.com/paper/a-neuropsychologically-grounded-evaluation-of-llm-cognitive-abilities", "sameAs": "https://arxiv.org/abs/2603.02540", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.02540" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-03T02:54:58.000Z", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 5 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Evaluation" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Evaluation", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "A Neuropsychologically Grounded Evaluation of LLM Cognitive ", "item": "https://sciencetostartup.com/paper/a-neuropsychologically-grounded-evaluation-of-llm-cognitive-abilities" } ] } ] }

Competitive landscape

Introduce NeuroCognition benchmark for evaluating LLM cognitive abilities distinct from existing benchmarks.

Segment

LLM Evaluation

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities

A Neuropsychologically Grounded Evaluation of LLM Cognitive Abilities

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline