ARXIV:2602.22452 · AGENTS · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

CWM: Contrastive World Models for Action Feasibility Learning in Embodied Agent Pipelines

arXiv

Develop action feasibility scorers for embodied agents using contrastive learning to improve reliability and safety in AI action planning.

Blocked on Code›Score6.0Evidence unverified

Opportunity summary

Pain Develop action feasibility scorers for embodied agents using contrastive learning to improve reliability and safety in AI action planning.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Develop action feasibility scorers for embodied agents using contrastive learning to improve reliability and safety in AI action planning. Existing approaches use supervised fine-tuning (SFT) to train action scorers, but SFT treats each candidate…

METHOD

Full abstract

A reliable action feasibility scorer is a critical bottleneck in embodied agent pipelines: before any planning or reasoning occurs, the agent must identify which candidate actions are physically executable in the current state. Existing approaches use supervised fine-tuning (SFT) to train action scorers, but SFT treats each candidate independently and does not explicitly teach the model to discriminate between actions that are physically correct and those that are subtly wrong. We propose the Contrastive World Model (CWM), which fine-tunes a large language model (LLM) as an action scorer using an InfoNCE contrastive objective with hard-mined negative examples. The key idea is to push valid actions away from invalid ones in scoring space, with special emphasis on hard negatives: semantically similar but physically incompatible candidates. We evaluate CWM on the ScienceWorld benchmark through two studies. First, an intrinsic affordance evaluation on 605 hard-negative test pairs shows that CWM outperforms SFT by +6.76 percentage points on Precision@1 for minimal-edit negatives -- cases where a single word changes the physical outcome -- and achieves a higher AUC-ROC (0.929 vs. 0.906). Second, a live filter characterisation study measures how well CWM ranks gold-path actions against all valid environment actions during task execution. Under out-of-distribution stress conditions, CWM maintains a significantly better safety margin (-2.39) than SFT (-3.96), indicating that the gold action is ranked closer to the top. These results support the hypothesis that contrastive training induces representations that capture physical feasibility more faithfully than SFT alone.

RESULT

ScienceToStartup currently rates this 6.0/10 on the public viability pass. First, an intrinsic affordance evaluation on 605 hard-negative test pairs shows that CWM outperforms SFT by +6.76 percentage points on Precision@1 for minimal-edit negatives…

WHY NOW

Agents moved forward this cycle; last verified April 2026. Public score 6.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score6.0

PainDevelop action feasibility scorers for embodied agents using contrastive learning to improve reliability and safety in AI action planning.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

Develop action feasibility scorers for embodied agents using contrastive learning to improve reliability and safety in AI action planning.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

Develop action feasibility scorers for embodied agents using contrastive learning to improve reliability and safety in AI action planning.

Segment

Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

6.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "ebe32316-a13c-4f63-976d-b543c3ba9309", "arxiv_id": "2602.22452", "canonical_route": "/paper/cwm-contrastive-world-models-for-action-feasibility-learning-in-embodied-agent-pipelines", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "cwm-contrastive-world-models-for-action-feasibility-learning-in-embodied-agent-pipelines", "endpoints": { "paper_pack": "/api/v1/paper/cwm-contrastive-world-models-for-action-feasibility-learning-in-embodied-agent-pipelines/paper-pack", "build_passport": "/api/v1/paper/cwm-contrastive-world-models-for-action-feasibility-learning-in-embodied-agent-pipelines/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "CWM: Contrastive World Models for Action Feasibility Learning in Embodied Agent Pipelines", "normalized_query": "2602.22452", "route": "/paper/cwm-contrastive-world-models-for-action-feasibility-learning-in-embodied-agent-pipelines", "paper_ref": "cwm-contrastive-world-models-for-action-feasibility-learning-in-embodied-agent-pipelines", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/cwm-contrastive-world-models-for-action-feasibility-learning-in-embodied-agent-pipelines#webpage", "url": "https://sciencetostartup.com/paper/cwm-contrastive-world-models-for-action-feasibility-learning-in-embodied-agent-pipelines", "name": "CWM: Contrastive World Models for Action Feasibility Learning in Embodied Agent Pipelines", "description": "Develop action feasibility scorers for embodied agents using contrastive learning to improve reliability and safety in AI action planning.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/cwm-contrastive-world-models-for-action-feasibility-learning-in-embodied-agent-pipelines#scholarlyArticle", "headline": "CWM: Contrastive World Models for Action Feasibility Learning in Embodied Agent Pipelines", "description": "Develop action feasibility scorers for embodied agents using contrastive learning to improve reliability and safety in AI action planning.", "url": "https://sciencetostartup.com/paper/cwm-contrastive-world-models-for-action-feasibility-learning-in-embodied-agent-pipelines", "sameAs": "https://arxiv.org/abs/2602.22452", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2602.22452" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-02-25T22:27:30.000Z", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 6 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Agents" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Agents", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "CWM: Contrastive World Models for Action Feasibility Learnin", "item": "https://sciencetostartup.com/paper/cwm-contrastive-world-models-for-action-feasibility-learning-in-embodied-agent-pipelines" } ] } ] }

Competitive landscape

Develop action feasibility scorers for embodied agents using contrastive learning to improve reliability and safety in AI action planning.

Segment

Agents

Adoption evidence

No public code link in the paper record yet

Commercial read

6.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

CWM: Contrastive World Models for Action Feasibility Learning in Embodied Agent Pipelines

CWM: Contrastive World Models for Action Feasibility Learning in Embodied Agent Pipelines

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline