ARXIV:2605.30913 · LLM RELIABILITY · SUBMITTED 01 JUN · 20:32 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Circuits

Soorya Ram Shimgekar · Agam Goyal · Amruta Parulekar · Joshua Chen · Yian Wang · Navin Kumar · +3 at arXiv

This research investigates how toxic language in prompts degrades LLM factual reliability and internal computation, finding that lexical toxicity significantly reduces accuracy and increases uncertainty.

Blocked on Code›Score3.0Evidence unverified

Opportunity summary

Pain This research investigates how toxic language in prompts degrades LLM factual reliability and internal computation, finding that lexical toxicity significantly reduces accuracy and increases uncertainty.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

METHOD

Full abstract

Large language models (LLMs) are increasingly deployed in conversational settings where user tone ranges from polite to adversarial or toxic, yet less is known about whether toxic language in otherwise semantically equivalent prompts can degrade factual reliability. We study how lexical and tone-based prompt perturbations affect the factual reliability of LLMs. Using controlled prompt variations across polite, random, and three toxicity levels, we evaluate five LLMs on ARC-Easy, GSM8K, and MMLU. We find that toxic lexical perturbations consistently reduce factual accuracy and increase uncertainty, while polite phrasing yields limited and inconsistent changes. To examine whether these answer inconsistencies correspond to internal changes, we conduct attribution-graph analyses of model activations and influences. We find that increasing toxicity selectively amplifies perturbation-sensitive variant nodes while relatively stable core reasoning nodes remain more invariant. These findings position prompt tone as a critical dimension of LLM reliability and provide behavioral and mechanistic evidence that surface-level lexical variation can alter factual outputs and internal computation.

RESULT

ScienceToStartup currently rates this 3.0/10 on the public viability pass. These findings position prompt tone as a critical dimension of LLM reliability and provide behavioral and mechanistic evidence that surface-level lexical variation can alter…

WHY NOW

LLM Reliability moved forward this cycle; last verified June 2026. Public score 3.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score3.0

PainThis research investigates how toxic language in prompts degrades LLM factual reliability and internal computation, finding that lexical toxicity significantly reduces accuracy and increases uncertainty.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Circuits

Soorya Ram Shimgekar · Agam Goyal · Amruta Parulekar · Joshua Chen · Yian Wang · Navin Kumar · +3 at arXiv

Competitive landscape

Segment

LLM Reliability

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "d2b4718f-8f58-4ceb-a988-89e0b27e5640", "arxiv_id": "2605.30913", "canonical_route": "/paper/toxic-hallucinaitions-perturbing-prompts-and-tracing-llm-circuits", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "toxic-hallucinaitions-perturbing-prompts-and-tracing-llm-circuits", "endpoints": { "paper_pack": "/api/v1/paper/toxic-hallucinaitions-perturbing-prompts-and-tracing-llm-circuits/paper-pack", "build_passport": "/api/v1/paper/toxic-hallucinaitions-perturbing-prompts-and-tracing-llm-circuits/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Circuits", "normalized_query": "2605.30913", "route": "/paper/toxic-hallucinaitions-perturbing-prompts-and-tracing-llm-circuits", "paper_ref": "toxic-hallucinaitions-perturbing-prompts-and-tracing-llm-circuits", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/toxic-hallucinaitions-perturbing-prompts-and-tracing-llm-circuits#webpage", "url": "https://sciencetostartup.com/paper/toxic-hallucinaitions-perturbing-prompts-and-tracing-llm-circuits", "name": "Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Circuits", "description": "This research investigates how toxic language in prompts degrades LLM factual reliability and internal computation, finding that lexical toxicity significantly reduces accuracy and increases uncertainty.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/toxic-hallucinaitions-perturbing-prompts-and-tracing-llm-circuits#scholarlyArticle", "headline": "Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Circuits", "description": "This research investigates how toxic language in prompts degrades LLM factual reliability and internal computation, finding that lexical toxicity significantly reduces accuracy and increases uncertainty.", "url": "https://sciencetostartup.com/paper/toxic-hallucinaitions-perturbing-prompts-and-tracing-llm-circuits", "sameAs": "https://arxiv.org/abs/2605.30913", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.30913" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-29T06:58:47.000Z", "author": [ { "@type": "Person", "name": "Soorya Ram Shimgekar" }, { "@type": "Person", "name": "Agam Goyal" }, { "@type": "Person", "name": "Amruta Parulekar" }, { "@type": "Person", "name": "Joshua Chen" }, { "@type": "Person", "name": "Yian Wang" }, { "@type": "Person", "name": "Navin Kumar" }, { "@type": "Person", "name": "Hari Sundaram" }, { "@type": "Person", "name": "Eshwar Chandrasekharan" }, { "@type": "Person", "name": "Koustuv Saha" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 3 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Reliability" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Reliability", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Ci", "item": "https://sciencetostartup.com/paper/toxic-hallucinaitions-perturbing-prompts-and-tracing-llm-circuits" } ] } ] }

Competitive landscape

Segment

LLM Reliability

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Circuits

Toxic HallucinAItions: Perturbing Prompts and Tracing LLM Circuits

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline