ARXIV:2604.06863 · LLM BIAS DETECTION · SUBMITTED 10 APR · 00:14 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Digital Skin, Digital Bias: Uncovering Tone-Based Biases in LLMs and Emoji Embeddings

Mingchen Li · Wajdi Aljedaani · Yingjie Liu · Navyasri Meka · Xuan Lu · Xinyue Ye · +2 at arXiv

This paper identifies and quantifies tone-based biases in LLMs and emoji embeddings, providing a framework for auditing and mitigating representational harms in online communication.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain This paper identifies and quantifies tone-based biases in LLMs and emoji embeddings, providing a framework for auditing and mitigating representational harms in online communication.

Evidence 46 refs | 5 sources | 67% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

This paper identifies and quantifies tone-based biases in LLMs and emoji embeddings, providing a framework for auditing and mitigating representational harms in online communication. As AI models, particularly Large Language Models (LLMs), increasingly mediate…

METHOD

Full abstract

Skin-toned emojis are crucial for fostering personal identity and social inclusion in online communication. As AI models, particularly Large Language Models (LLMs), increasingly mediate interactions on web platforms, the risk that these systems perpetuate societal biases through their representation of such symbols is a significant concern. This paper presents the first large-scale comparative study of bias in skin-toned emoji representations across two distinct model classes. We systematically evaluate dedicated emoji embedding models (emoji2vec, emoji-sw2v) against four modern LLMs (Llama, Gemma, Qwen, and Mistral). Our analysis first reveals a critical performance gap: while LLMs demonstrate robust support for skin tone modifiers, widely-used specialized emoji models exhibit severe deficiencies. More importantly, a multi-faceted investigation into semantic consistency, representational similarity, sentiment polarity, and core biases uncovers systemic disparities. We find evidence of skewed sentiment and inconsistent meanings associated with emojis across different skin tones, highlighting latent biases within these foundational models. Our findings underscore the urgent need for developers and platforms to audit and mitigate these representational harms, ensuring that AI's role on the web promotes genuine equity rather than reinforcing societal biases.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Our analysis first reveals a critical performance gap: while LLMs demonstrate robust support for skin tone modifiers, widely-used specialized emoji models exhibit severe deficiencies.…

WHY NOW

LLM Bias Detection moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainThis paper identifies and quantifies tone-based biases in LLMs and emoji embeddings, providing a framework for auditing and mitigating representational harms in online communication.

Evidence46 refs | 5 sources | 67% coverage

Blockerno shell-level blocker reported

Analysis summary

This paper identifies and quantifies tone-based biases in LLMs and emoji embeddings, providing a framework for auditing and mitigating representational harms in online communication.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

This paper identifies and quantifies tone-based biases in LLMs and emoji embeddings, providing a framework for auditing and mitigating representational harms in online communication.

Segment

LLM Bias Detection

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "56a01583-7723-45e6-948a-d8838abf709b", "arxiv_id": "2604.06863", "canonical_route": "/paper/digital-skin-digital-bias-uncovering-tone-based-biases-in-llms-and-emoji-embeddings", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "digital-skin-digital-bias-uncovering-tone-based-biases-in-llms-and-emoji-embeddings", "endpoints": { "paper_pack": "/api/v1/paper/digital-skin-digital-bias-uncovering-tone-based-biases-in-llms-and-emoji-embeddings/paper-pack", "build_passport": "/api/v1/paper/digital-skin-digital-bias-uncovering-tone-based-biases-in-llms-and-emoji-embeddings/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Digital Skin, Digital Bias: Uncovering Tone-Based Biases in LLMs and Emoji Embeddings", "normalized_query": "2604.06863", "route": "/paper/digital-skin-digital-bias-uncovering-tone-based-biases-in-llms-and-emoji-embeddings", "paper_ref": "digital-skin-digital-bias-uncovering-tone-based-biases-in-llms-and-emoji-embeddings", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/digital-skin-digital-bias-uncovering-tone-based-biases-in-llms-and-emoji-embeddings#webpage", "url": "https://sciencetostartup.com/paper/digital-skin-digital-bias-uncovering-tone-based-biases-in-llms-and-emoji-embeddings", "name": "Digital Skin, Digital Bias: Uncovering Tone-Based Biases in LLMs and Emoji Embeddings", "description": "This paper identifies and quantifies tone-based biases in LLMs and emoji embeddings, providing a framework for auditing and mitigating representational harms in online communication.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/digital-skin-digital-bias-uncovering-tone-based-biases-in-llms-and-emoji-embeddings#scholarlyArticle", "headline": "Digital Skin, Digital Bias: Uncovering Tone-Based Biases in LLMs and Emoji Embeddings", "description": "This paper identifies and quantifies tone-based biases in LLMs and emoji embeddings, providing a framework for auditing and mitigating representational harms in online communication.", "url": "https://sciencetostartup.com/paper/digital-skin-digital-bias-uncovering-tone-based-biases-in-llms-and-emoji-embeddings", "sameAs": "https://arxiv.org/abs/2604.06863", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.06863" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-08T09:24:56.000Z", "author": [ { "@type": "Person", "name": "Mingchen Li" }, { "@type": "Person", "name": "Wajdi Aljedaani" }, { "@type": "Person", "name": "Yingjie Liu" }, { "@type": "Person", "name": "Navyasri Meka" }, { "@type": "Person", "name": "Xuan Lu" }, { "@type": "Person", "name": "Xinyue Ye" }, { "@type": "Person", "name": "Junhua Ding" }, { "@type": "Person", "name": "Yunhe Feng" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Bias Detection" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Bias Detection", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Digital Skin, Digital Bias: Uncovering Tone-Based Biases in ", "item": "https://sciencetostartup.com/paper/digital-skin-digital-bias-uncovering-tone-based-biases-in-llms-and-emoji-embeddings" } ] } ] }

Competitive landscape

This paper identifies and quantifies tone-based biases in LLMs and emoji embeddings, providing a framework for auditing and mitigating representational harms in online communication.

Segment

LLM Bias Detection

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Digital Skin, Digital Bias: Uncovering Tone-Based Biases in LLMs and Emoji Embeddings

Digital Skin, Digital Bias: Uncovering Tone-Based Biases in LLMs and Emoji Embeddings

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline