ARXIV:2606.03312 · ROBOTICS · SUBMITTED 03 JUN · 20:43 UTC · FRESHNESS FRESH

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

RobotValues: Evaluating Household Robots When Human Values Conflict

Jongwook Han · Hyeongjin Kim · Yohan Jo · arXiv

RobotValues is a benchmark for evaluating household robots in value-conflicting scenarios, revealing biases in current VLMs and highlighting the need for prioritizing human values beyond task completion.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain RobotValues is a benchmark for evaluating household robots in value-conflicting scenarios, revealing biases in current VLMs and highlighting the need for prioritizing human values beyond task completion.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

METHOD

Full abstract

While household robots are often evaluated based on task completion, everyday domestic environments involve value-conflicting situations in which robots are expected to choose actions that prioritize other values than task success, such as human autonomy, efficiency, or social appropriateness. Yet, there are no benchmarks for evaluating robots' value preferences in such scenarios. We introduce RobotValues, a benchmark to evaluate household robot planners in 10K value-conflict scenarios. Each instance consists of a realistic household image with multiple plausible robot actions that prioritize different human values. We construct RobotValues through LLM-assisted scenario generation, stakeholder-grounded value extraction, image generation and automatic quality control. Using RobotValues we evaluate VLMs used in robotics and find that models exhibit default value preferences, including safety and accommodation, while underselecting privacy-prioritizing actions. When the models are instructed to prioritize specific values that conflict with their own preferences, they often fail to override their default actions, choosing incorrect actions for 80% of the time. These findings suggest that household robot evaluation should measure not only task completion or safety compliance, but also whether robots can choose among plausible actions when human values conflict.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. These findings suggest that household robot evaluation should measure not only task completion or safety compliance, but also whether robots can choose among plausible…

WHY NOW

Robotics moved forward this cycle; last verified June 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainRobotValues is a benchmark for evaluating household robots in value-conflicting scenarios, revealing biases in current VLMs and highlighting the need for prioritizing human values beyond task completion.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Segment

Robotics

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "37a54a1e-b79b-4953-a8c7-5c4ba7697477", "arxiv_id": "2606.03312", "canonical_route": "/paper/robotvalues-evaluating-household-robots-when-human-values-conflict", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "robotvalues-evaluating-household-robots-when-human-values-conflict", "endpoints": { "paper_pack": "/api/v1/paper/robotvalues-evaluating-household-robots-when-human-values-conflict/paper-pack", "build_passport": "/api/v1/paper/robotvalues-evaluating-household-robots-when-human-values-conflict/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "RobotValues: Evaluating Household Robots When Human Values Conflict", "normalized_query": "2606.03312", "route": "/paper/robotvalues-evaluating-household-robots-when-human-values-conflict", "paper_ref": "robotvalues-evaluating-household-robots-when-human-values-conflict", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/robotvalues-evaluating-household-robots-when-human-values-conflict#webpage", "url": "https://sciencetostartup.com/paper/robotvalues-evaluating-household-robots-when-human-values-conflict", "name": "RobotValues: Evaluating Household Robots When Human Values Conflict", "description": "RobotValues is a benchmark for evaluating household robots in value-conflicting scenarios, revealing biases in current VLMs and highlighting the need for prioritizing human values beyond task completion.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/robotvalues-evaluating-household-robots-when-human-values-conflict#scholarlyArticle", "headline": "RobotValues: Evaluating Household Robots When Human Values Conflict", "description": "RobotValues is a benchmark for evaluating household robots in value-conflicting scenarios, revealing biases in current VLMs and highlighting the need for prioritizing human values beyond task completion.", "url": "https://sciencetostartup.com/paper/robotvalues-evaluating-household-robots-when-human-values-conflict", "sameAs": "https://arxiv.org/abs/2606.03312", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2606.03312" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-06-02T08:25:01.000Z", "author": [ { "@type": "Person", "name": "Jongwook Han" }, { "@type": "Person", "name": "Hyeongjin Kim" }, { "@type": "Person", "name": "Yohan Jo" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Robotics" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Robotics", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "RobotValues: Evaluating Household Robots When Human Values C", "item": "https://sciencetostartup.com/paper/robotvalues-evaluating-household-robots-when-human-values-conflict" } ] } ] }

Competitive landscape

Segment

Robotics

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

RobotValues: Evaluating Household Robots When Human Values Conflict

RobotValues: Evaluating Household Robots When Human Values Conflict

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline