ARXIV:2603.28213 · LLM FAIRNESS AND NON-STANDARD LANGUAGE · SUBMITTED 31 MAR · 20:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

\textit{Versteasch du mi?} Computational and Socio-Linguistic Perspectives on GenAI, LLMs, and Non-Standard Language

Verena Platzgummer · John McCrae · Sina Ahmadi · arXiv

Developing LLMs that can understand and process non-standard dialects to bridge the digital language divide.

Ship in 2-4 weeks›Score4.0Evidence unverified

Opportunity summary

Pain Developing LLMs that can understand and process non-standard dialects to bridge the digital language divide.

Evidence 31 refs | 3 sources | 67% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Developing LLMs that can understand and process non-standard dialects to bridge the digital language divide. Critical sociolinguistic work has also argued that these technologies are not only made possible by prior socio-historical processes of…

METHOD

Full abstract

The design of Large Language Models and generative artificial intelligence has been shown to be "unfair" to less-spoken languages and to deepen the digital language divide. Critical sociolinguistic work has also argued that these technologies are not only made possible by prior socio-historical processes of linguistic standardisation, often grounded in European nationalist and colonial projects, but also exacerbate epistemologies of language as "monolithic, monolingual, syntactically standardized systems of meaning". In our paper, we draw on earlier work on the intersections of technology and language policy and bring our respective expertise in critical sociolinguistics and computational linguistics to bear on an interrogation of these arguments. We take two different complexes of non-standard linguistic varieties in our respective repertoires--South Tyrolean dialects, which are widely used in informal communication in South Tyrol, Italy, as well as varieties of Kurdish--as starting points to an interdisciplinary exploration of the intersections between GenAI and linguistic variation and standardisation. We discuss both how LLMs can be made to deal with nonstandard language from a technical perspective, and whether, when or how this can contribute to "democratic and decolonial digital and machine learning strategies", which has direct policy implications.

RESULT

ScienceToStartup currently rates this 4.0/10 on the public viability pass. We discuss both how LLMs can be made to deal with nonstandard language from a technical perspective, and whether, when or how this can…

WHY NOW

LLM Fairness and Non-Standard Language moved forward this cycle; last verified April 2026. Public score 4.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score4.0

PainDeveloping LLMs that can understand and process non-standard dialects to bridge the digital language divide.

Evidence31 refs | 3 sources | 67% coverage

Blockerno shell-level blocker reported

Analysis summary

Developing LLMs that can understand and process non-standard dialects to bridge the digital language divide.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

Developing LLMs that can understand and process non-standard dialects to bridge the digital language divide.

Segment

LLM Fairness and Non-Standard Language

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "06081872-66a2-4404-bc7e-0f2a2df7b4d2", "arxiv_id": "2603.28213", "canonical_route": "/paper/textit-versteasch-du-mi-computational-and-socio-linguistic-perspectives-on-genai-llms-and-non-standard-language", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "textit-versteasch-du-mi-computational-and-socio-linguistic-perspectives-on-genai-llms-and-non-standard-language", "endpoints": { "paper_pack": "/api/v1/paper/textit-versteasch-du-mi-computational-and-socio-linguistic-perspectives-on-genai-llms-and-non-standard-language/paper-pack", "build_passport": "/api/v1/paper/textit-versteasch-du-mi-computational-and-socio-linguistic-perspectives-on-genai-llms-and-non-standard-language/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "\\textit{Versteasch du mi?} Computational and Socio-Linguistic Perspectives on GenAI, LLMs, and Non-Standard Language", "normalized_query": "2603.28213", "route": "/paper/textit-versteasch-du-mi-computational-and-socio-linguistic-perspectives-on-genai-llms-and-non-standard-language", "paper_ref": "textit-versteasch-du-mi-computational-and-socio-linguistic-perspectives-on-genai-llms-and-non-standard-language", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/textit-versteasch-du-mi-computational-and-socio-linguistic-perspectives-on-genai-llms-and-non-standard-language#webpage", "url": "https://sciencetostartup.com/paper/textit-versteasch-du-mi-computational-and-socio-linguistic-perspectives-on-genai-llms-and-non-standard-language", "name": "\\textit{Versteasch du mi?} Computational and Socio-Linguistic Perspectives on GenAI, LLMs, and Non-Standard Language", "description": "Developing LLMs that can understand and process non-standard dialects to bridge the digital language divide.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/textit-versteasch-du-mi-computational-and-socio-linguistic-perspectives-on-genai-llms-and-non-standard-language#scholarlyArticle", "headline": "\\textit{Versteasch du mi?} Computational and Socio-Linguistic Perspectives on GenAI, LLMs, and Non-Standard Language", "description": "Developing LLMs that can understand and process non-standard dialects to bridge the digital language divide.", "url": "https://sciencetostartup.com/paper/textit-versteasch-du-mi-computational-and-socio-linguistic-perspectives-on-genai-llms-and-non-standard-language", "sameAs": "https://arxiv.org/abs/2603.28213", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.28213" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-30T09:34:41.000Z", "author": [ { "@type": "Person", "name": "Verena Platzgummer" }, { "@type": "Person", "name": "John McCrae" }, { "@type": "Person", "name": "Sina Ahmadi" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 4 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Fairness and Non-Standard Language" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Fairness and Non-Standard Language", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "\\textit{Versteasch du mi?} Computational and Socio-Linguisti", "item": "https://sciencetostartup.com/paper/textit-versteasch-du-mi-computational-and-socio-linguistic-perspectives-on-genai-llms-and-non-standard-language" } ] } ] }

Competitive landscape

Developing LLMs that can understand and process non-standard dialects to bridge the digital language divide.

Segment

LLM Fairness and Non-Standard Language

Adoption evidence

No public code link in the paper record yet

Commercial read

4.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

\textit{Versteasch du mi?} Computational and Socio-Linguistic Perspectives on GenAI, LLMs, and Non-Standard Language

\textit{Versteasch du mi?} Computational and Socio-Linguistic Perspectives on GenAI, LLMs, and Non-Standard Language

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline