Evidence Receipt. Related Resources.
Evidence Receipt. Related Resources.
Compared to this week’s papers
Verification pending
Use This Via API or MCP
Signal Canvas is the citation-first public layer for turning one paper into a structured commercialization narrative. Use it to hand off into REST, MCP, Build Loop, and launch-pack execution without losing source lineage.
Use This Via API or MCP
Route this paper proof surface into REST, MCP, or developer workflows while preserving the same evidence receipt and related-resource context.
Page Freshness
Canonical route: /signal-canvas/when-names-change-verdicts-intervention-consistency-reveals-systematic-bias-in-llm-decision-making
This page is showing the last landed evidence receipt and score bundle because the latest proof data is outside the freshness window.
Agent Handoff
Canonical ID when-names-change-verdicts-intervention-consistency-reveals-systematic-bias-in-llm-decision-making | Route /signal-canvas/when-names-change-verdicts-intervention-consistency-reveals-systematic-bias-in-llm-decision-making
REST example
curl https://sciencetostartup.com/api/v1/agent-handoff/signal-canvas/when-names-change-verdicts-intervention-consistency-reveals-systematic-bias-in-llm-decision-makingMCP example
{
"tool": "search_signal_canvas",
"arguments": {
"mode": "paper",
"paper_ref": "when-names-change-verdicts-intervention-consistency-reveals-systematic-bias-in-llm-decision-making",
"query_text": "Summarize When Names Change Verdicts: Intervention Consistency Reveals Systematic Bias in LLM Decision-Making"
}
}source_context
{
"surface": "signal_canvas",
"mode": "paper",
"query": "When Names Change Verdicts: Intervention Consistency Reveals Systematic Bias in LLM Decision-Making",
"normalized_query": "2603.18530",
"route": "/signal-canvas/when-names-change-verdicts-intervention-consistency-reveals-systematic-bias-in-llm-decision-making",
"paper_ref": "when-names-change-verdicts-intervention-consistency-reveals-systematic-bias-in-llm-decision-making",
"topic_slug": null,
"benchmark_ref": null,
"dataset_ref": null
}Claims: 7
References: Pending verification
Proof: Verification pending
Freshness state: computing
Source paper: When Names Change Verdicts: Intervention Consistency Reveals Systematic Bias in LLM Decision-Making
PDF: https://arxiv.org/pdf/2603.18530v1
Source count: Pending verification
Coverage: 17%
Last proof check: 2026-04-02T02:30:40.136Z
Signal Canvas receipt window
/buildability/when-names-change-verdicts-intervention-consistency-reveals-systematic-bias-in-llm-decision-making
Subject: When Names Change Verdicts: Intervention Consistency Reveals Systematic Bias in LLM Decision-Making
Verdict
Watch
Verdict is Watch because viability or proof quality is intermediate and should be re-evaluated before execution.
Preparing verified analysis
Dimensions overall score 8.0
No public code linked for this paper yet.
We introduce ICE-Guard, a framework applying intervention consistency testing to detect three types of spurious feature reliance: demographic (name/race swaps), authority (credential/prestige swaps), and framing (positive/negative restatements).
The abstract explicitly introduces ICE-Guard and its purpose.
partial
we find that (1) authority bias (mean 5.8%) and framing bias (5.0%) substantially exceed demographic bias (2.2%), challenging the field's narrow focus on demographics;
The abstract provides specific percentages for each bias type, directly comparing them.
partial
(2) bias concentrates in specific domains -- finance shows 22.6% authority bias while criminal justice shows only 2.8%;
The abstract provides specific domain examples and their corresponding bias percentages.
partial
(3) structured decomposition, where the LLM extracts features and a deterministic rubric decides, reduces flip rates by up to 100% (median 49% across 9 models).
The abstract quantifies the reduction in flip rates achieved by structured decomposition.
partial
We demonstrate an ICE-guided detect-diagnose-mitigate-verify loop achieving cumulative 78% bias reduction via iterative prompt patching.
The abstract states the cumulative bias reduction achieved by the proposed loop.
partial
Validation against real COMPAS recidivism data shows COMPAS-derived flip rates exceed pooled synthetic rates, suggesting our benchmark provides a conservative estimate of real-world bias.
The abstract directly compares real and synthetic data flip rates and draws a conclusion about the benchmark's conservatism.
partial
Across 3,000 vignettes spanning 10 high-stakes domains, we evaluate 11 LLMs from 8 families
The abstract explicitly states the number of LLMs and families evaluated.
partial
Related resources will appear here when this paper maps cleanly to topic, benchmark, or dataset surfaces.
Use an AI coding agent to implement this research.
Lightweight coding agent in your terminal.
Agentic coding tool for terminal workflows.
AI agent mindset installer and workflow scaffolder.
AI-first code editor built on VS Code.
Free, open-source editor by Microsoft.
Estimated $10K - $14K over 6-10 weeks.
See exactly what it costs to build this -- with 3 comparable funded startups.
7-day free trial. Cancel anytime.
Discover the researchers behind this paper and find similar experts.
7-day free trial. Cancel anytime.
Time to first demo
Insufficient data
No first-demo timestamp, owner estimate, or elapsed demo receipt is attached to this surface.
Structured compute envelope
Insufficient data
No data, compute, hardware, memory, latency, dependency, or serving requirement receipt is attached.
Receipt path
/buildability/when-names-change-verdicts-intervention-consistency-reveals-systematic-bias-in-llm-decision-making
Paper ref
when-names-change-verdicts-intervention-consistency-reveals-systematic-bias-in-llm-decision-making
arXiv id
2603.18530
Generated at
2026-04-02T02:30:40.136Z
Evidence freshness
stale
Last verification
2026-04-02T02:30:40.136Z
Sources
0
References
0
Coverage
17%
Lineage hash
79b6650590a7ea40b91e9c58468aab6d1822fdbe425a5ecc28e2560095048fd3
Canonical opportunity-kernel lineage hash.
External signature
unsigned_external
No founder, registry, pilot, or production-adoption signature is attached to this receipt.
Verification
not_verified
Verification is blocked until an external signature is provided.
Verification pending / evidence receipt incomplete
repo_url
references