ARXIV:2604.21152 · LLM BIAS · SUBMITTED 24 APR · 20:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Dialect vs Demographics: Quantifying LLM Bias from Implicit Linguistic Signals vs. Explicit User Profiles

Irti Haq · Belén Saldías · arXiv

Quantifying LLM bias by disentangling implicit linguistic signals from explicit user profiles to reveal safety paradoxes.

Ship in 2-4 weeks›Score6.0Evidence unverified

Opportunity summary

Pain Quantifying LLM bias by disentangling implicit linguistic signals from explicit user profiles to reveal safety paradoxes.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Quantifying LLM bias by disentangling implicit linguistic signals from explicit user profiles to reveal safety paradoxes. However, it remains unclear whether these disparities arise from the explicitly stated identity itself or from the way…

METHOD

Full abstract

As state-of-the-art Large Language Models (LLMs) have become ubiquitous, ensuring equitable performance across diverse demographics is critical. However, it remains unclear whether these disparities arise from the explicitly stated identity itself or from the way identity is signaled. In real-world interactions, users' identity is often conveyed implicitly through a complex combination of various socio-linguistic factors. This study disentangles these signals by employing a factorial design with over 24,000 responses from two open-weight LLMs (Gemma-3-12B and Qwen-3-VL-8B), comparing prompts with explicitly announced user profiles against implicit dialect signals (e.g., AAVE, Singlish) across various sensitive domains. Our results uncover a unique paradox in LLM safety where users achieve ``better'' performance by sounding like a demographic than by stating they belong to it. Explicit identity prompts activate aggressive safety filters, increasing refusal rates and reducing semantic similarity compared to our reference text for Black users. In contrast, implicit dialect cues trigger a powerful ``dialect jailbreak,'' reducing refusal probability to near zero while simultaneously achieving a greater level of semantic similarity to the reference texts compared to Standard American English prompts. However, this ``dialect jailbreak'' introduces a critical safety trade-off regarding content sanitization. We find that current safety alignment techniques are brittle and over-indexed on explicit keywords, creating a bifurcated user experience where ``standard'' users receive cautious, sanitized information while dialect speakers navigate a less sanitized, more raw, and potentially a more hostile information landscape and highlights a fundamental tension in alignment--between equitable and linguistic diversity--and underscores the need for safety mechanisms that generalize beyond explicit cues.

RESULT

ScienceToStartup currently rates this 6.0/10 on the public viability pass. Our results uncover a unique paradox in LLM safety where users achieve ``better'' performance by sounding like a demographic than by stating they belong…

WHY NOW

LLM Bias moved forward this cycle; last verified April 2026. Public score 6.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score6.0

PainQuantifying LLM bias by disentangling implicit linguistic signals from explicit user profiles to reveal safety paradoxes.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

Quantifying LLM bias by disentangling implicit linguistic signals from explicit user profiles to reveal safety paradoxes.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Quantifying LLM bias by disentangling implicit linguistic signals from explicit user profiles to reveal safety paradoxes.

Segment

LLM Bias

Adoption evidence

No public code link in the paper record yet

Commercial read

6.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "a44c71e1-2f23-495a-a581-d2a4d26af1e0", "arxiv_id": "2604.21152", "canonical_route": "/paper/dialect-vs-demographics-quantifying-llm-bias-from-implicit-linguistic-signals-vs-explicit-user-profiles", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "dialect-vs-demographics-quantifying-llm-bias-from-implicit-linguistic-signals-vs-explicit-user-profiles", "endpoints": { "paper_pack": "/api/v1/paper/dialect-vs-demographics-quantifying-llm-bias-from-implicit-linguistic-signals-vs-explicit-user-profiles/paper-pack", "build_passport": "/api/v1/paper/dialect-vs-demographics-quantifying-llm-bias-from-implicit-linguistic-signals-vs-explicit-user-profiles/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Dialect vs Demographics: Quantifying LLM Bias from Implicit Linguistic Signals vs. Explicit User Profiles", "normalized_query": "2604.21152", "route": "/paper/dialect-vs-demographics-quantifying-llm-bias-from-implicit-linguistic-signals-vs-explicit-user-profiles", "paper_ref": "dialect-vs-demographics-quantifying-llm-bias-from-implicit-linguistic-signals-vs-explicit-user-profiles", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/dialect-vs-demographics-quantifying-llm-bias-from-implicit-linguistic-signals-vs-explicit-user-profiles#webpage", "url": "https://sciencetostartup.com/paper/dialect-vs-demographics-quantifying-llm-bias-from-implicit-linguistic-signals-vs-explicit-user-profiles", "name": "Dialect vs Demographics: Quantifying LLM Bias from Implicit Linguistic Signals vs. Explicit User Profiles", "description": "Quantifying LLM bias by disentangling implicit linguistic signals from explicit user profiles to reveal safety paradoxes.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/dialect-vs-demographics-quantifying-llm-bias-from-implicit-linguistic-signals-vs-explicit-user-profiles#scholarlyArticle", "headline": "Dialect vs Demographics: Quantifying LLM Bias from Implicit Linguistic Signals vs. Explicit User Profiles", "description": "Quantifying LLM bias by disentangling implicit linguistic signals from explicit user profiles to reveal safety paradoxes.", "url": "https://sciencetostartup.com/paper/dialect-vs-demographics-quantifying-llm-bias-from-implicit-linguistic-signals-vs-explicit-user-profiles", "sameAs": "https://arxiv.org/abs/2604.21152", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.21152" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-22T23:33:18.000Z", "author": [ { "@type": "Person", "name": "Irti Haq" }, { "@type": "Person", "name": "Belén Saldías" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 6 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Bias" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Bias", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Dialect vs Demographics: Quantifying LLM Bias from Implicit ", "item": "https://sciencetostartup.com/paper/dialect-vs-demographics-quantifying-llm-bias-from-implicit-linguistic-signals-vs-explicit-user-profiles" } ] } ] }

Competitive landscape

Quantifying LLM bias by disentangling implicit linguistic signals from explicit user profiles to reveal safety paradoxes.

Segment

LLM Bias

Adoption evidence

No public code link in the paper record yet

Commercial read

6.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Dialect vs Demographics: Quantifying LLM Bias from Implicit Linguistic Signals vs. Explicit User Profiles

Dialect vs Demographics: Quantifying LLM Bias from Implicit Linguistic Signals vs. Explicit User Profiles

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline