ARXIV:2601.19106 · CODE CORRECTION TOOLS · SUBMITTED 19 MAR · 18:48 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Detecting and Correcting Hallucinations in LLM-Generated Code via Deterministic AST Analysis

arXiv

A deterministic AST-based tool to auto-correct semantic errors in LLM-generated code, enhancing reliability without runtime execution.

Blocked on Code›Score8.0Evidence unverified

Opportunity summary

Pain A deterministic AST-based tool to auto-correct semantic errors in LLM-generated code, enhancing reliability without runtime execution.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A deterministic AST-based tool to auto-correct semantic errors in LLM-generated code, enhancing reliability without runtime execution. Existing mitigations like constrained decoding or non-deterministic LLM-in-the-loop repair are often unreliable for these errors.

METHOD

Full abstract

Large Language Models (LLMs) for code generation boost productivity but frequently introduce Knowledge Conflicting Hallucinations (KCHs), subtle, semantic errors, such as non-existent API parameters, that evade linters and cause runtime failures. Existing mitigations like constrained decoding or non-deterministic LLM-in-the-loop repair are often unreliable for these errors. This paper investigates whether a deterministic, static-analysis framework can reliably detect \textit{and} auto-correct KCHs. We propose a post-processing framework that parses generated code into an Abstract Syntax Tree (AST) and validates it against a dynamically-generated Knowledge Base (KB) built via library introspection. This non-executing approach uses deterministic rules to find and fix both API and identifier-level conflicts. On a manually-curated dataset of 200 Python snippets, our framework detected KCHs with 100\% precision and 87.6\% recall (0.934 F1-score), and successfully auto-corrected 77.0\% of all identified hallucinations. Our findings demonstrate that this deterministic post-processing approach is a viable and reliable alternative to probabilistic repair, offering a clear path toward trustworthy code generation.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. Our findings demonstrate that this deterministic post-processing approach is a viable and reliable alternative to probabilistic repair, offering a clear path toward trustworthy code…

WHY NOW

Code Correction Tools moved forward this cycle; last verified April 2026. Public score 8.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainA deterministic AST-based tool to auto-correct semantic errors in LLM-generated code, enhancing reliability without runtime execution.

Evidence0 refs | 0 sources | 33% coverage

Blockermissing authors

Analysis summary

A deterministic AST-based tool to auto-correct semantic errors in LLM-generated code, enhancing reliability without runtime execution.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

A deterministic AST-based tool to auto-correct semantic errors in LLM-generated code, enhancing reliability without runtime execution.

Segment

Code Correction Tools

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(5)

LLMLOOP: Improving LLM-Generated Code and Tests Through Automated Iterative Feedback Loops

2025Ravin Ravi, Dylan Bradshaw et al.

Static Analysis as a Feedback Loop: Enhancing LLM-Generated Code Beyond Correctness

2025Scott Blyth, Sherlock A. Licorish et al.

Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges

2025Yunseo Lee, John Youngeun Song et al.

The Impact of AI on Developer Productivity: Evidence from GitHub Copilot

2023Sida Peng, Eirini Kalliamvakou et al.

arXiv

2019Lucy Rosenbloom

{ "contract_version": "paper-r2", "paper_id": "18f31e4b-c450-44da-93f9-7d43f64d9171", "arxiv_id": "2601.19106", "canonical_route": "/paper/detecting-and-correcting-hallucinations-in-llm-generated-code-via-deterministic-ast-analysis", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "detecting-and-correcting-hallucinations-in-llm-generated-code-via-deterministic-ast-analysis", "endpoints": { "paper_pack": "/api/v1/paper/detecting-and-correcting-hallucinations-in-llm-generated-code-via-deterministic-ast-analysis/paper-pack", "build_passport": "/api/v1/paper/detecting-and-correcting-hallucinations-in-llm-generated-code-via-deterministic-ast-analysis/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "Detecting and Correcting Hallucinations in LLM-Generated Code via Deterministic AST Analysis", "normalized_query": "2601.19106", "route": "/paper/detecting-and-correcting-hallucinations-in-llm-generated-code-via-deterministic-ast-analysis", "paper_ref": "detecting-and-correcting-hallucinations-in-llm-generated-code-via-deterministic-ast-analysis", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/detecting-and-correcting-hallucinations-in-llm-generated-code-via-deterministic-ast-analysis#webpage", "url": "https://sciencetostartup.com/paper/detecting-and-correcting-hallucinations-in-llm-generated-code-via-deterministic-ast-analysis", "name": "Detecting and Correcting Hallucinations in LLM-Generated Code via Deterministic AST Analysis", "description": "A deterministic AST-based tool to auto-correct semantic errors in LLM-generated code, enhancing reliability without runtime execution.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/detecting-and-correcting-hallucinations-in-llm-generated-code-via-deterministic-ast-analysis#scholarlyArticle", "headline": "Detecting and Correcting Hallucinations in LLM-Generated Code via Deterministic AST Analysis", "description": "A deterministic AST-based tool to auto-correct semantic errors in LLM-generated code, enhancing reliability without runtime execution.", "url": "https://sciencetostartup.com/paper/detecting-and-correcting-hallucinations-in-llm-generated-code-via-deterministic-ast-analysis", "sameAs": "https://arxiv.org/abs/2601.19106", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2601.19106" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-01-27T02:16:37.000Z", "citation": [ { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "a7da0a5b7331a5e4a97fc944d2a8e84bafc42179" }, "url": "https://www.semanticscholar.org/paper/a7da0a5b7331a5e4a97fc944d2a8e84bafc42179" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "f02fb72c0c4dec27675363ec59510e8f0d809da5" }, "url": "https://www.semanticscholar.org/paper/f02fb72c0c4dec27675363ec59510e8f0d809da5" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "ab90297f5a95b79d67620c32d2474c9a1dfef8c8" }, "url": "https://www.semanticscholar.org/paper/ab90297f5a95b79d67620c32d2474c9a1dfef8c8" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "15abedb29536d50afeeec739a25358255cbda3e8" }, "url": "https://www.semanticscholar.org/paper/15abedb29536d50afeeec739a25358255cbda3e8" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "f4327b978dec52f16b089c222c43543f8ecf4717" }, "url": "https://www.semanticscholar.org/paper/f4327b978dec52f16b089c222c43543f8ecf4717" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Code Correction Tools" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Code Correction Tools", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "Detecting and Correcting Hallucinations in LLM-Generated Cod", "item": "https://sciencetostartup.com/paper/detecting-and-correcting-hallucinations-in-llm-generated-code-via-deterministic-ast-analysis" } ] } ] }

Competitive landscape

A deterministic AST-based tool to auto-correct semantic errors in LLM-generated code, enhancing reliability without runtime execution.

Segment

Code Correction Tools

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(5)

LLMLOOP: Improving LLM-Generated Code and Tests Through Automated Iterative Feedback Loops

2025Ravin Ravi, Dylan Bradshaw et al.

Static Analysis as a Feedback Loop: Enhancing LLM-Generated Code Beyond Correctness

2025Scott Blyth, Sherlock A. Licorish et al.

Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges

2025Yunseo Lee, John Youngeun Song et al.

The Impact of AI on Developer Productivity: Evidence from GitHub Copilot

2023Sida Peng, Eirini Kalliamvakou et al.

arXiv

2019Lucy Rosenbloom

Detecting and Correcting Hallucinations in LLM-Generated Code via Deterministic AST Analysis

Detecting and Correcting Hallucinations in LLM-Generated Code via Deterministic AST Analysis

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(5)

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(5)

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline