ARXIV:2605.09874 · EGOCENTRIC VIDEO UNDERSTANDING · SUBMITTED 12 MAY · 20:15 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding

Ziyang Wang · Yue Zhang · Shoubin Yu · Ce Zhang · Zengqi Zhao · Jaehong Yoon · +3 at arXiv

A benchmark for week-long egocentric video understanding that tests memory-driven reasoning across entities, events, and behaviors, revealing current limitations in long-context multimodal systems.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A benchmark for week-long egocentric video understanding that tests memory-driven reasoning across entities, events, and behaviors, revealing current limitations in long-context multimodal systems.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A benchmark for week-long egocentric video understanding that tests memory-driven reasoning across entities, events, and behaviors, revealing current limitations in long-context multimodal systems. In ultra-long video settings, relevant information is sparsely distributed across hours…

METHOD

Full abstract

Next-generation visual assistants, such as smart glasses, embodied agents, and always-on life-logging systems, must reason over an entire day or more of continuous visual experience. In ultra-long video settings, relevant information is sparsely distributed across hours or days, making memory a fundamental challenge: models must accumulate information over time, recall prior states, track temporal order, and abstract recurring patterns. However, existing week-long video benchmarks are primarily designed for perception and recognition, such as moment localization or global summarization, rather than reasoning that requires integrating evidence across multiple days. To address this gap, we introduce EgoMemReason, a comprehensive benchmark that systematically evaluates week-long egocentric video understanding through memory-driven reasoning. EgoMemReason evaluates three complementary memory types: entity memory, tracking how object states evolve and change across days; event memory, recalling and ordering activities separated by hours or days; and behavior memory, abstracting recurring patterns from sparse, repeated observations over the whole week period. EgoMemReason comprises 500 questions across three memory types and six core challenges, with an average of 5.1 video segments of evidence per question and 25.9 hours of memory backtracking. We evaluate EgoMemReason on 17 methods across MLLMs and agentic frameworks, revealing that even the best model achieves only 39.6% overall accuracy. Further analysis shows that the three memory types fail for distinct reasons and that performance degrades as evidence spans longer temporal horizons, revealing that long-horizon memory remains far from solved. We believe EgoMemReason establishes a strong foundation for evaluating and advancing long-context, memory-aware multimodal systems.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. We evaluate EgoMemReason on 17 methods across MLLMs and agentic frameworks, revealing that even the best model achieves only 39.6% overall accuracy. A public…

WHY NOW

Egocentric Video Understanding moved forward this cycle; last verified May 2026. Public score 7.0/10. Implementation evidence is present through a linked repository.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA benchmark for week-long egocentric video understanding that tests memory-driven reasoning across entities, events, and behaviors, revealing current limitations in long-context multimodal systems.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

A benchmark for week-long egocentric video understanding that tests memory-driven reasoning across entities, events, and behaviors, revealing current limitations in long-context multimodal systems.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A benchmark for week-long egocentric video understanding that tests memory-driven reasoning across entities, events, and behaviors, revealing current limitations in long-context multimodal systems.

Segment

Egocentric Video Understanding

Adoption evidence

Public code linked for build inspection

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "6ea9bc2a-a119-48c2-95c7-963655420c72", "arxiv_id": "2605.09874", "canonical_route": "/paper/egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding", "endpoints": { "paper_pack": "/api/v1/paper/egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding/paper-pack", "build_passport": "/api/v1/paper/egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding", "normalized_query": "2605.09874", "route": "/paper/egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding", "paper_ref": "egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding#webpage", "url": "https://sciencetostartup.com/paper/egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding", "name": "EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding", "description": "A benchmark for week-long egocentric video understanding that tests memory-driven reasoning across entities, events, and behaviors, revealing current limitations in long-context multimodal systems.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding#scholarlyArticle", "headline": "EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding", "description": "A benchmark for week-long egocentric video understanding that tests memory-driven reasoning across entities, events, and behaviors, revealing current limitations in long-context multimodal systems.", "url": "https://sciencetostartup.com/paper/egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding", "sameAs": "https://arxiv.org/abs/2605.09874", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.09874" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-11T01:59:59.000Z", "author": [ { "@type": "Person", "name": "Ziyang Wang" }, { "@type": "Person", "name": "Yue Zhang" }, { "@type": "Person", "name": "Shoubin Yu" }, { "@type": "Person", "name": "Ce Zhang" }, { "@type": "Person", "name": "Zengqi Zhao" }, { "@type": "Person", "name": "Jaehong Yoon" }, { "@type": "Person", "name": "Hyunji Lee" }, { "@type": "Person", "name": "Gedas Bertasius" }, { "@type": "Person", "name": "Mohit Bansal" } ], "codeRepository": "https://github.com/Ziyang412/EgoMemReason", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Egocentric Video Understanding" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code, repo url" } ] }, { "@type": "SoftwareSourceCode", "@id": "https://sciencetostartup.com/paper/egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding#software", "name": "EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding - Source Code", "description": "A benchmark for week-long egocentric video understanding that tests memory-driven reasoning across entities, events, and behaviors, revealing current limitations in long-context multimodal systems.", "codeRepository": "https://github.com/Ziyang412/EgoMemReason", "url": "https://github.com/Ziyang412/EgoMemReason" }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Egocentric Video Understanding", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-H", "item": "https://sciencetostartup.com/paper/egomemreason-a-memory-driven-reasoning-benchmark-for-long-horizon-egocentric-video-understanding" } ] } ] }

Competitive landscape

A benchmark for week-long egocentric video understanding that tests memory-driven reasoning across entities, events, and behaviors, revealing current limitations in long-context multimodal systems.

Segment

Egocentric Video Understanding

Adoption evidence

Public code linked for build inspection

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding

EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline