ARXIV:2604.02669 · LLM ALIGNMENT & BIAS · SUBMITTED 06 APR · 20:15 UTC · FRESHNESS UNKNOWN

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Redirected, Not Removed: Task-Dependent Stereotyping Reveals the Limits of LLM Alignments

Divyanshu Kumar · Ishita Gupta · Nitin Aravind Birur · Tanay Baswa · Sahil Agarwal · Prashanth Harshangi · arXiv

This research provides a novel, comprehensive framework for auditing LLM bias that reveals systematic mischaracterizations by current alignment practices, offering a path to more robust safety and fairness evaluations.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain This research provides a novel, comprehensive framework for auditing LLM bias that reveals systematic mischaracterizations by current alignment practices, offering a path to more robust safety and fairness evaluations.

Evidence 0 refs | 0 sources | 0% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

METHOD

How biased is a language model? The answer depends on how you ask.

Full abstract

How biased is a language model? The answer depends on how you ask. A model that refuses to choose between castes for a leadership role will, in a fill-in-the-blank task, reliably associate upper castes with purity and lower castes with lack of hygiene. Single-task benchmarks miss this because they capture only one slice of a model's bias profile. We introduce a hierarchical taxonomy covering 9 bias types, including under-studied axes like caste, linguistic, and geographic bias, operationalized through 7 evaluation tasks that span explicit decision-making to implicit association. Auditing 7 commercial and open-weight LLMs with \textasciitilde45K prompts, we find three systematic patterns. First, bias is task-dependent: models counter stereotypes on explicit probes but reproduce them on implicit ones, with Stereotype Score divergences up to 0.43 between task types for the same model and identity groups. Second, safety alignment is asymmetric: models refuse to assign negative traits to marginalized groups, but freely associate positive traits with privileged ones. Third, under-studied bias axes show the strongest stereotyping across all models, suggesting alignment effort tracks benchmark coverage rather than harm severity. These results demonstrate that single-benchmark audits systematically mischaracterize LLM bias and that current alignment practices mask representational harm rather than mitigating it.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Third, under-studied bias axes show the strongest stereotyping across all models, suggesting alignment effort tracks benchmark coverage rather than harm severity. Code availability is…

WHY NOW

LLM Alignment & Bias moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainThis research provides a novel, comprehensive framework for auditing LLM bias that reveals systematic mischaracterizations by current alignment practices, offering a path to more robust safety and fairness evaluations.

Evidence0 refs | 0 sources | 0% coverage

Blockerno shell-level blocker reported

Analysis summary

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Redirected, Not Removed: Task-Dependent Stereotyping Reveals the Limits of LLM Alignments

Divyanshu Kumar · Ishita Gupta · Nitin Aravind Birur · Tanay Baswa · Sahil Agarwal · Prashanth Harshangi · arXiv

Competitive landscape

Segment

LLM Alignment & Bias

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "07b1afeb-9dbf-4d4c-b407-09492f4d199d", "arxiv_id": "2604.02669", "canonical_route": "/paper/redirected-not-removed-task-dependent-stereotyping-reveals-the-limits-of-llm-alignments", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "redirected-not-removed-task-dependent-stereotyping-reveals-the-limits-of-llm-alignments", "endpoints": { "paper_pack": "/api/v1/paper/redirected-not-removed-task-dependent-stereotyping-reveals-the-limits-of-llm-alignments/paper-pack", "build_passport": "/api/v1/paper/redirected-not-removed-task-dependent-stereotyping-reveals-the-limits-of-llm-alignments/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Redirected, Not Removed: Task-Dependent Stereotyping Reveals the Limits of LLM Alignments", "normalized_query": "2604.02669", "route": "/paper/redirected-not-removed-task-dependent-stereotyping-reveals-the-limits-of-llm-alignments", "paper_ref": "redirected-not-removed-task-dependent-stereotyping-reveals-the-limits-of-llm-alignments", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/redirected-not-removed-task-dependent-stereotyping-reveals-the-limits-of-llm-alignments#webpage", "url": "https://sciencetostartup.com/paper/redirected-not-removed-task-dependent-stereotyping-reveals-the-limits-of-llm-alignments", "name": "Redirected, Not Removed: Task-Dependent Stereotyping Reveals the Limits of LLM Alignments", "description": "This research provides a novel, comprehensive framework for auditing LLM bias that reveals systematic mischaracterizations by current alignment practices, offering a path to more robust safety and fairness evaluations.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/redirected-not-removed-task-dependent-stereotyping-reveals-the-limits-of-llm-alignments#scholarlyArticle", "headline": "Redirected, Not Removed: Task-Dependent Stereotyping Reveals the Limits of LLM Alignments", "description": "This research provides a novel, comprehensive framework for auditing LLM bias that reveals systematic mischaracterizations by current alignment practices, offering a path to more robust safety and fairness evaluations.", "url": "https://sciencetostartup.com/paper/redirected-not-removed-task-dependent-stereotyping-reveals-the-limits-of-llm-alignments", "sameAs": "https://arxiv.org/abs/2604.02669", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.02669" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-03T03:03:21.000Z", "author": [ { "@type": "Person", "name": "Divyanshu Kumar" }, { "@type": "Person", "name": "Ishita Gupta" }, { "@type": "Person", "name": "Nitin Aravind Birur" }, { "@type": "Person", "name": "Tanay Baswa" }, { "@type": "Person", "name": "Sahil Agarwal" }, { "@type": "Person", "name": "Prashanth Harshangi" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Alignment & Bias" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Alignment & Bias", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Redirected, Not Removed: Task-Dependent Stereotyping Reveals", "item": "https://sciencetostartup.com/paper/redirected-not-removed-task-dependent-stereotyping-reveals-the-limits-of-llm-alignments" } ] } ] }

Competitive landscape

Segment

LLM Alignment & Bias

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Redirected, Not Removed: Task-Dependent Stereotyping Reveals the Limits of LLM Alignments

Redirected, Not Removed: Task-Dependent Stereotyping Reveals the Limits of LLM Alignments

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline