ARXIV:2604.27093 · LLM SAFETY & ALIGNMENT · SUBMITTED 01 MAY · 20:31 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Useless but Safe? Benchmarking Utility Recovery with User Intent Clarification in Multi-Turn Conversations

Mingqian Zheng · Malia Morgan · Liwei Jiang · Carolyn Rose · Maarten Sap · arXiv

A benchmark for evaluating if LLMs can safely recover utility by clarifying user intent in multi-turn conversations.

Ship in 2-4 weeks›Score5.0Evidence unverified

Opportunity summary

Pain A benchmark for evaluating if LLMs can safely recover utility by clarifying user intent in multi-turn conversations.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A benchmark for evaluating if LLMs can safely recover utility by clarifying user intent in multi-turn conversations. We introduce CarryOnBench, the first interactive benchmark that measures whether LLMs can revise their interpretation of user…

METHOD

Full abstract

Current LLM safety alignment techniques improve model robustness against adversarial attacks, but overlook whether and how LLMs can recover helpfulness when benign users clarify their intent. We introduce CarryOnBench, the first interactive benchmark that measures whether LLMs can revise their interpretation of user intent and recover utility, while remaining safe through multi-turn conversations. Starting from 398 seemingly harmful queries with benign underlying intents, we simulate 5,970 conversations by varying user follow-up sequences, evaluating 14 models on both intent-aligned utility and safety. CarryOnBench yields 1,866 different conversation flows of 4--12 turns, totaling 23,880 model responses. We design Ben-Util, a checklist-based metric that evaluates how well each model response fulfills the user's benign information need using atomic items. At turn one, models fulfill only 10.5--37.6% of the user's benign information need. When the same query includes the benign intent upfront, models fulfill 25.1--72.1%, confirming that models withhold information due to intent misinterpretation, not limited knowledge. With benign clarifications in multi-turn conversations, 13 of 14 models approach or exceed this single-turn baseline, yet recovery cost varies across models. We identify three failure modes invisible to single-turn evaluations: utility lock-in, where a model rarely updates despite clarification; unsafe recovery, where a model updates at disproportionate safety cost; and repetitive recovery, where a model recycles prior responses rather than providing new information. Moreover, conversations converge to similar harmfulness levels regardless of how conservative the model starts. These findings expose a gap that single-turn evaluations miss -- whether a model is appropriately cautious or simply unresponsive to clarified user intent.

RESULT

ScienceToStartup currently rates this 5.0/10 on the public viability pass. Current LLM safety alignment techniques improve model robustness against adversarial attacks, but overlook whether and how LLMs can recover helpfulness when benign users clarify…

WHY NOW

LLM Safety & Alignment moved forward this cycle; last verified May 2026. Public score 5.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score5.0

PainA benchmark for evaluating if LLMs can safely recover utility by clarifying user intent in multi-turn conversations.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A benchmark for evaluating if LLMs can safely recover utility by clarifying user intent in multi-turn conversations.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A benchmark for evaluating if LLMs can safely recover utility by clarifying user intent in multi-turn conversations.

Segment

LLM Safety & Alignment

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "4d5ed44e-8cc7-4140-8de0-d631cfbf1a51", "arxiv_id": "2604.27093", "canonical_route": "/paper/useless-but-safe-benchmarking-utility-recovery-with-user-intent-clarification-in-multi-turn-conversations", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "useless-but-safe-benchmarking-utility-recovery-with-user-intent-clarification-in-multi-turn-conversations", "endpoints": { "paper_pack": "/api/v1/paper/useless-but-safe-benchmarking-utility-recovery-with-user-intent-clarification-in-multi-turn-conversations/paper-pack", "build_passport": "/api/v1/paper/useless-but-safe-benchmarking-utility-recovery-with-user-intent-clarification-in-multi-turn-conversations/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Useless but Safe? Benchmarking Utility Recovery with User Intent Clarification in Multi-Turn Conversations", "normalized_query": "2604.27093", "route": "/paper/useless-but-safe-benchmarking-utility-recovery-with-user-intent-clarification-in-multi-turn-conversations", "paper_ref": "useless-but-safe-benchmarking-utility-recovery-with-user-intent-clarification-in-multi-turn-conversations", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/useless-but-safe-benchmarking-utility-recovery-with-user-intent-clarification-in-multi-turn-conversations#webpage", "url": "https://sciencetostartup.com/paper/useless-but-safe-benchmarking-utility-recovery-with-user-intent-clarification-in-multi-turn-conversations", "name": "Useless but Safe? Benchmarking Utility Recovery with User Intent Clarification in Multi-Turn Conversations", "description": "A benchmark for evaluating if LLMs can safely recover utility by clarifying user intent in multi-turn conversations.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/useless-but-safe-benchmarking-utility-recovery-with-user-intent-clarification-in-multi-turn-conversations#scholarlyArticle", "headline": "Useless but Safe? Benchmarking Utility Recovery with User Intent Clarification in Multi-Turn Conversations", "description": "A benchmark for evaluating if LLMs can safely recover utility by clarifying user intent in multi-turn conversations.", "url": "https://sciencetostartup.com/paper/useless-but-safe-benchmarking-utility-recovery-with-user-intent-clarification-in-multi-turn-conversations", "sameAs": "https://arxiv.org/abs/2604.27093", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.27093" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-29T18:37:18.000Z", "author": [ { "@type": "Person", "name": "Mingqian Zheng" }, { "@type": "Person", "name": "Malia Morgan" }, { "@type": "Person", "name": "Liwei Jiang" }, { "@type": "Person", "name": "Carolyn Rose" }, { "@type": "Person", "name": "Maarten Sap" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 5 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Safety & Alignment" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Safety & Alignment", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Useless but Safe? Benchmarking Utility Recovery with User In", "item": "https://sciencetostartup.com/paper/useless-but-safe-benchmarking-utility-recovery-with-user-intent-clarification-in-multi-turn-conversations" } ] } ] }

Competitive landscape

A benchmark for evaluating if LLMs can safely recover utility by clarifying user intent in multi-turn conversations.

Segment

LLM Safety & Alignment

Adoption evidence

No public code link in the paper record yet

Commercial read

5.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Useless but Safe? Benchmarking Utility Recovery with User Intent Clarification in Multi-Turn Conversations

Useless but Safe? Benchmarking Utility Recovery with User Intent Clarification in Multi-Turn Conversations

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline