ARXIV:2603.12572 · MEMORY RETRIEVAL · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

LMEB: Long-horizon Memory Embedding Benchmark

arXiv

LMEB is a benchmark framework designed to evaluate memory embeddings for complex long-horizon retrieval tasks.

Blocked on Code›Score7.0Evidence unverified

Opportunity summary

Pain LMEB is a benchmark framework designed to evaluate memory embeddings for complex long-horizon retrieval tasks.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

LMEB is a benchmark framework designed to evaluate memory embeddings for complex long-horizon retrieval tasks. To address this, we introduce the Long-horizon Memory Embedding Benchmark (LMEB), a comprehensive framework that evaluates embedding models' capabilities…

METHOD

Full abstract

Memory embeddings are crucial for memory-augmented systems, such as OpenClaw, but their evaluation is underexplored in current text embedding benchmarks, which narrowly focus on traditional passage retrieval and fail to assess models' ability to handle long-horizon memory retrieval tasks involving fragmented, context-dependent, and temporally distant information. To address this, we introduce the Long-horizon Memory Embedding Benchmark (LMEB), a comprehensive framework that evaluates embedding models' capabilities in handling complex, long-horizon memory retrieval tasks. LMEB spans 22 datasets and 193 zero-shot retrieval tasks across 4 memory types: episodic, dialogue, semantic, and procedural, with both AI-generated and human-annotated data. These memory types differ in terms of level of abstraction and temporal dependency, capturing distinct aspects of memory retrieval that reflect the diverse challenges of the real world. We evaluate 15 widely used embedding models, ranging from hundreds of millions to ten billion parameters. The results reveal that (1) LMEB provides a reasonable level of difficulty; (2) Larger models do not always perform better; (3) LMEB and MTEB exhibit orthogonality. This suggests that the field has yet to converge on a universal model capable of excelling across all memory retrieval tasks, and that performance in traditional passage retrieval may not generalize to long-horizon memory retrieval. In summary, by providing a standardized and reproducible evaluation framework, LMEB fills a crucial gap in memory embedding evaluation, driving further advancements in text embedding for handling long-term, context-dependent memory retrieval. LMEB is available at https://github.com/KaLM-Embedding/LMEB.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. The results reveal that (1) LMEB provides a reasonable level of difficulty; (2) Larger models do not always perform better; (3) LMEB and MTEB…

WHY NOW

Memory Retrieval moved forward this cycle; last verified April 2026. Public score 7.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainLMEB is a benchmark framework designed to evaluate memory embeddings for complex long-horizon retrieval tasks.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

LMEB is a benchmark framework designed to evaluate memory embeddings for complex long-horizon retrieval tasks.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

LMEB is a benchmark framework designed to evaluate memory embeddings for complex long-horizon retrieval tasks.

Segment

Memory Retrieval

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "ea7d8f56-ea42-4dbf-9ca1-6f78a90a77c9", "arxiv_id": "2603.12572", "canonical_route": "/paper/lmeb-long-horizon-memory-embedding-benchmark", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "lmeb-long-horizon-memory-embedding-benchmark", "endpoints": { "paper_pack": "/api/v1/paper/lmeb-long-horizon-memory-embedding-benchmark/paper-pack", "build_passport": "/api/v1/paper/lmeb-long-horizon-memory-embedding-benchmark/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "LMEB: Long-horizon Memory Embedding Benchmark", "normalized_query": "2603.12572", "route": "/paper/lmeb-long-horizon-memory-embedding-benchmark", "paper_ref": "lmeb-long-horizon-memory-embedding-benchmark", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/lmeb-long-horizon-memory-embedding-benchmark#webpage", "url": "https://sciencetostartup.com/paper/lmeb-long-horizon-memory-embedding-benchmark", "name": "LMEB: Long-horizon Memory Embedding Benchmark", "description": "LMEB is a benchmark framework designed to evaluate memory embeddings for complex long-horizon retrieval tasks.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/lmeb-long-horizon-memory-embedding-benchmark#scholarlyArticle", "headline": "LMEB: Long-horizon Memory Embedding Benchmark", "description": "LMEB is a benchmark framework designed to evaluate memory embeddings for complex long-horizon retrieval tasks.", "url": "https://sciencetostartup.com/paper/lmeb-long-horizon-memory-embedding-benchmark", "sameAs": "https://arxiv.org/abs/2603.12572", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.12572" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-13T02:09:57.000Z", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Memory Retrieval" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Memory Retrieval", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "LMEB: Long-horizon Memory Embedding Benchmark", "item": "https://sciencetostartup.com/paper/lmeb-long-horizon-memory-embedding-benchmark" } ] } ] }

Competitive landscape

LMEB is a benchmark framework designed to evaluate memory embeddings for complex long-horizon retrieval tasks.

Segment

Memory Retrieval

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

LMEB: Long-horizon Memory Embedding Benchmark

LMEB: Long-horizon Memory Embedding Benchmark

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline