ARXIV:2602.10117 · BIAS DETECTION · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Biases in the Blind Spot: Detecting What LLMs Fail to Mention

arXiv

Automated pipeline to uncover task-specific biases in LLMs without predefined categories.

Blocked on Code›Score6.0Evidence unverified

Opportunity summary

Pain Automated pipeline to uncover task-specific biases in LLMs without predefined categories.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Automated pipeline to uncover task-specific biases in LLMs without predefined categories. We call these *unverbalized biases*.

METHOD

Large Language Models (LLMs) often provide chain-of-thought (CoT) reasoning traces that appear plausible, but may hide internal biases. We call these *unverbalized biases*.

Full abstract

Large Language Models (LLMs) often provide chain-of-thought (CoT) reasoning traces that appear plausible, but may hide internal biases. We call these *unverbalized biases*. Monitoring models via their stated reasoning is therefore unreliable, and existing bias evaluations typically require predefined categories and hand-crafted datasets. In this work, we introduce a fully automated, black-box pipeline for detecting task-specific unverbalized biases. Given a task dataset, the pipeline uses LLM autoraters to generate candidate bias concepts. It then tests each concept on progressively larger input samples by generating positive and negative variations, and applies statistical techniques for multiple testing and early stopping. A concept is flagged as an unverbalized bias if it yields statistically significant performance differences while not being cited as justification in the model's CoTs. We evaluate our pipeline across six LLMs on three decision tasks (hiring, loan approval, and university admissions). Our technique automatically discovers previously unknown biases in these models (e.g., Spanish fluency, English proficiency, writing formality). In the same run, the pipeline also validates biases that were manually identified by prior work (gender, race, religion, ethnicity). More broadly, our proposed approach provides a practical, scalable path to automatic task-specific bias discovery.

RESULT

ScienceToStartup currently rates this 6.0/10 on the public viability pass. More broadly, our proposed approach provides a practical, scalable path to automatic task-specific bias discovery.

WHY NOW

Bias Detection moved forward this cycle; last verified April 2026. Public score 6.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score6.0

PainAutomated pipeline to uncover task-specific biases in LLMs without predefined categories.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

Automated pipeline to uncover task-specific biases in LLMs without predefined categories.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

References(23)

Reference metadata pending (156d890219f3ed2e393516ee56db5ed95fd8b115)

Reference metadata pending (7821a00468713f478293237078e73cb58d47737b)

Reference metadata pending (fc3fd70f768564c21d13862ba4209980e3b243e1)

Reference metadata pending (e71ff5188cc6435f0ba3ebbb054829c0b1dd3ba8)

Reference metadata pending (fb04470af15b10abc57f0b7d5c3e8614fcc56b21)

Reference metadata pending (3f0455e9dc4bca6a80bf7c50cd1279249bc65abc)

Reference metadata pending (3d7afcc9720aa11b7b1b08701c2ab289538ec546)

Reference metadata pending (59faaf0c63d2e4281d985e373b286c517577a933)

Reference metadata pending (7a63385cfdb5c7ecd6b78e3eadb832c4b92ba62b)

Reference metadata pending (d644a0849c1ab78e2d33ca762ede3be77e2eb294)

Reference metadata pending (3d8a3517231643c1df79bc32c8c2664a4cba3a41)

Reference metadata pending (a0a79dad89857a96f8f71b14238e5237cbfc4787)

Reference metadata pending (a66ade2f872e726e4ea58278058c4b6df4cbc2be)

Reference metadata pending (7dc928f41e15f65f1267bd87b0fcfcc7e715cb56)

Reference metadata pending (1b6e810ce0afd0dd093f789d2b2742d047e316d5)

Reference metadata pending (d47a682723f710395454687319bb55635e653105)

Reference metadata pending (3166c3e50a161ec4807812776b874baca991ce68)

Reference metadata pending (d40fee01a7708099f9e9392f10ac0b370b7ed8a8)

Reference metadata pending (7e7343a5608fff1c68c5259db0c77b9193f1546d)

Reference metadata pending (9e463eefadbcd336c69270a299666e4104d50159)

Showing 20 of 23 references

{ "contract_version": "paper-r2", "paper_id": "8b30d62f-ab35-46b6-96c9-a2f23a2da351", "arxiv_id": "2602.10117", "canonical_route": "/paper/biases-in-the-blind-spot-detecting-what-llms-fail-to-mention", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "biases-in-the-blind-spot-detecting-what-llms-fail-to-mention", "endpoints": { "paper_pack": "/api/v1/paper/biases-in-the-blind-spot-detecting-what-llms-fail-to-mention/paper-pack", "build_passport": "/api/v1/paper/biases-in-the-blind-spot-detecting-what-llms-fail-to-mention/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Biases in the Blind Spot: Detecting What LLMs Fail to Mention", "normalized_query": "2602.10117", "route": "/paper/biases-in-the-blind-spot-detecting-what-llms-fail-to-mention", "paper_ref": "biases-in-the-blind-spot-detecting-what-llms-fail-to-mention", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/biases-in-the-blind-spot-detecting-what-llms-fail-to-mention#webpage", "url": "https://sciencetostartup.com/paper/biases-in-the-blind-spot-detecting-what-llms-fail-to-mention", "name": "Biases in the Blind Spot: Detecting What LLMs Fail to Mention", "description": "Automated pipeline to uncover task-specific biases in LLMs without predefined categories.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/biases-in-the-blind-spot-detecting-what-llms-fail-to-mention#scholarlyArticle", "headline": "Biases in the Blind Spot: Detecting What LLMs Fail to Mention", "description": "Automated pipeline to uncover task-specific biases in LLMs without predefined categories.", "url": "https://sciencetostartup.com/paper/biases-in-the-blind-spot-detecting-what-llms-fail-to-mention", "sameAs": "https://arxiv.org/abs/2602.10117", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2602.10117" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-02-10T18:59:56.000Z", "citation": [ { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "156d890219f3ed2e393516ee56db5ed95fd8b115" }, "url": "https://www.semanticscholar.org/paper/156d890219f3ed2e393516ee56db5ed95fd8b115" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "7821a00468713f478293237078e73cb58d47737b" }, "url": "https://www.semanticscholar.org/paper/7821a00468713f478293237078e73cb58d47737b" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "fc3fd70f768564c21d13862ba4209980e3b243e1" }, "url": "https://www.semanticscholar.org/paper/fc3fd70f768564c21d13862ba4209980e3b243e1" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "e71ff5188cc6435f0ba3ebbb054829c0b1dd3ba8" }, "url": "https://www.semanticscholar.org/paper/e71ff5188cc6435f0ba3ebbb054829c0b1dd3ba8" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "fb04470af15b10abc57f0b7d5c3e8614fcc56b21" }, "url": "https://www.semanticscholar.org/paper/fb04470af15b10abc57f0b7d5c3e8614fcc56b21" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "3f0455e9dc4bca6a80bf7c50cd1279249bc65abc" }, "url": "https://www.semanticscholar.org/paper/3f0455e9dc4bca6a80bf7c50cd1279249bc65abc" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "3d7afcc9720aa11b7b1b08701c2ab289538ec546" }, "url": "https://www.semanticscholar.org/paper/3d7afcc9720aa11b7b1b08701c2ab289538ec546" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "59faaf0c63d2e4281d985e373b286c517577a933" }, "url": "https://www.semanticscholar.org/paper/59faaf0c63d2e4281d985e373b286c517577a933" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "7a63385cfdb5c7ecd6b78e3eadb832c4b92ba62b" }, "url": "https://www.semanticscholar.org/paper/7a63385cfdb5c7ecd6b78e3eadb832c4b92ba62b" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "d644a0849c1ab78e2d33ca762ede3be77e2eb294" }, "url": "https://www.semanticscholar.org/paper/d644a0849c1ab78e2d33ca762ede3be77e2eb294" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "3d8a3517231643c1df79bc32c8c2664a4cba3a41" }, "url": "https://www.semanticscholar.org/paper/3d8a3517231643c1df79bc32c8c2664a4cba3a41" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "a0a79dad89857a96f8f71b14238e5237cbfc4787" }, "url": "https://www.semanticscholar.org/paper/a0a79dad89857a96f8f71b14238e5237cbfc4787" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "a66ade2f872e726e4ea58278058c4b6df4cbc2be" }, "url": "https://www.semanticscholar.org/paper/a66ade2f872e726e4ea58278058c4b6df4cbc2be" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "7dc928f41e15f65f1267bd87b0fcfcc7e715cb56" }, "url": "https://www.semanticscholar.org/paper/7dc928f41e15f65f1267bd87b0fcfcc7e715cb56" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "1b6e810ce0afd0dd093f789d2b2742d047e316d5" }, "url": "https://www.semanticscholar.org/paper/1b6e810ce0afd0dd093f789d2b2742d047e316d5" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "d47a682723f710395454687319bb55635e653105" }, "url": "https://www.semanticscholar.org/paper/d47a682723f710395454687319bb55635e653105" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "3166c3e50a161ec4807812776b874baca991ce68" }, "url": "https://www.semanticscholar.org/paper/3166c3e50a161ec4807812776b874baca991ce68" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "d40fee01a7708099f9e9392f10ac0b370b7ed8a8" }, "url": "https://www.semanticscholar.org/paper/d40fee01a7708099f9e9392f10ac0b370b7ed8a8" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "7e7343a5608fff1c68c5259db0c77b9193f1546d" }, "url": "https://www.semanticscholar.org/paper/7e7343a5608fff1c68c5259db0c77b9193f1546d" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "9e463eefadbcd336c69270a299666e4104d50159" }, "url": "https://www.semanticscholar.org/paper/9e463eefadbcd336c69270a299666e4104d50159" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 6 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Bias Detection" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Bias Detection", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Biases in the Blind Spot: Detecting What LLMs Fail to Mentio", "item": "https://sciencetostartup.com/paper/biases-in-the-blind-spot-detecting-what-llms-fail-to-mention" } ] } ] }