ARXIV:2604.07017 · AI ASSISTANTS · SUBMITTED 10 APR · 00:14 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

A-MBER: Affective Memory Benchmark for Emotion Recognition

Deliang Wen · Ke Sun · Yu Wang · arXiv

A benchmark for AI assistants to understand user emotions based on past interactions, enabling more personalized responses.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A benchmark for AI assistants to understand user emotions based on past interactions, enabling more personalized responses.

Evidence 17 refs | 3 sources | 67% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A benchmark for AI assistants to understand user emotions based on past interactions, enabling more personalized responses. However, this capability remains insufficiently evaluated.

METHOD

Full abstract

AI assistants that interact with users over time need to interpret the user's current emotional state in order to respond appropriately and personally. However, this capability remains insufficiently evaluated. Existing emotion datasets mainly assess local or instantaneous affect, while long-term memory benchmarks focus largely on factual recall, temporal consistency, or knowledge updating. As a result, current resources provide limited support for testing whether a model can use remembered interaction history to interpret a user's present affective state. We introduce A-MBER, an Affective Memory Benchmark for Emotion Recognition, to evaluate this capability. A-MBER focuses on present affective interpretation grounded in remembered multi-session interaction history. Given an interaction trajectory and a designated anchor turn, a model must infer the user's current affective state, identify historically relevant evidence, and justify its interpretation in a grounded way. The benchmark is constructed through a staged pipeline with explicit intermediate representations, including long-horizon planning, conversation generation, annotation, question construction, and final packaging. It supports judgment, retrieval, and explanation tasks, together with robustness settings such as modality degradation and insufficient-evidence conditions. Experiments compare local-context, long-context, retrieved-memory, structured-memory, and gold-evidence conditions within a unified framework. Results show that A-MBER is especially discriminative on the subsets it is designed to stress, including long-range implicit affect, high-dependency memory levels, trajectory-based reasoning, and adversarial settings. These findings suggest that memory supports affective interpretation not simply by providing more history, but by enabling more selective, grounded, and context-sensitive use of past interaction

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. As a result, current resources provide limited support for testing whether a model can use remembered interaction history to interpret a user's present affective…

WHY NOW

AI Assistants moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA benchmark for AI assistants to understand user emotions based on past interactions, enabling more personalized responses.

Evidence17 refs | 3 sources | 67% coverage

Blockerno shell-level blocker reported

Analysis summary

A benchmark for AI assistants to understand user emotions based on past interactions, enabling more personalized responses.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A benchmark for AI assistants to understand user emotions based on past interactions, enabling more personalized responses.

Segment

AI Assistants

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "3a9febb1-e613-4cba-a31c-e42e8ea40998", "arxiv_id": "2604.07017", "canonical_route": "/paper/a-mber-affective-memory-benchmark-for-emotion-recognition", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "a-mber-affective-memory-benchmark-for-emotion-recognition", "endpoints": { "paper_pack": "/api/v1/paper/a-mber-affective-memory-benchmark-for-emotion-recognition/paper-pack", "build_passport": "/api/v1/paper/a-mber-affective-memory-benchmark-for-emotion-recognition/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "A-MBER: Affective Memory Benchmark for Emotion Recognition", "normalized_query": "2604.07017", "route": "/paper/a-mber-affective-memory-benchmark-for-emotion-recognition", "paper_ref": "a-mber-affective-memory-benchmark-for-emotion-recognition", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/a-mber-affective-memory-benchmark-for-emotion-recognition#webpage", "url": "https://sciencetostartup.com/paper/a-mber-affective-memory-benchmark-for-emotion-recognition", "name": "A-MBER: Affective Memory Benchmark for Emotion Recognition", "description": "A benchmark for AI assistants to understand user emotions based on past interactions, enabling more personalized responses.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/a-mber-affective-memory-benchmark-for-emotion-recognition#scholarlyArticle", "headline": "A-MBER: Affective Memory Benchmark for Emotion Recognition", "description": "A benchmark for AI assistants to understand user emotions based on past interactions, enabling more personalized responses.", "url": "https://sciencetostartup.com/paper/a-mber-affective-memory-benchmark-for-emotion-recognition", "sameAs": "https://arxiv.org/abs/2604.07017", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.07017" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-08T12:36:18.000Z", "author": [ { "@type": "Person", "name": "Deliang Wen" }, { "@type": "Person", "name": "Ke Sun" }, { "@type": "Person", "name": "Yu Wang" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "AI Assistants" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "AI Assistants", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "A-MBER: Affective Memory Benchmark for Emotion Recognition", "item": "https://sciencetostartup.com/paper/a-mber-affective-memory-benchmark-for-emotion-recognition" } ] } ] }

Competitive landscape

A benchmark for AI assistants to understand user emotions based on past interactions, enabling more personalized responses.

Segment

AI Assistants

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

A-MBER: Affective Memory Benchmark for Emotion Recognition

A-MBER: Affective Memory Benchmark for Emotion Recognition

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline