ARXIV:2605.11738 · LLM AUDITING · SUBMITTED 13 MAY · 20:55 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

OptArgus: A Multi-Agent System to Detect Hallucinations in LLM-based Optimization Modeling

Zhong Li · Zihan Guo · Xiaohan Lu · Juntao Wang · Jie Song · Chao Shen · +2 at arXiv

A multi-agent system to detect and audit hallucinations in LLM-generated optimization models by checking structural consistency.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A multi-agent system to detect and audit hallucinations in LLM-generated optimization models by checking structural consistency.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A multi-agent system to detect and audit hallucinations in LLM-generated optimization models by checking structural consistency. We formulate this issue as \emph{optimization-modeling hallucination detection}, namely structural consistency auditing over the problem description, symbolic model,…

METHOD

Full abstract

Large language models (LLMs) are increasingly used to translate natural-language optimization problems into mathematical formulations and solver code, but matching the reference objective value is not a reliable test of correctness: an artifact may agree numerically while still changing the underlying optimization semantics. We formulate this issue as \emph{optimization-modeling hallucination detection}, namely structural consistency auditing over the problem description, symbolic model, and solver implementation. We develop, to our knowledge, the first fine-grained hallucination taxonomy specifically for optimization modeling, spanning objective, variable, constraint, and implementation failures. We use this taxonomy to design OptArgus, a multi-agent detector with conductor routing, specialist auditors, and evidence consolidation. To evaluate this setting, we introduce a three-part benchmark suite with $484$ clean artifacts, $1266$ controlled injected artifacts, and $6292$ natural LLM-generated artifacts. Against a matched single-agent baseline, OptArgus produces fewer false alarms on clean artifacts, more accurate top-ranked localization on controlled single-error cases, and stronger detection on natural model outputs. Together, these contributions turn optimization-modeling hallucination detection into a concrete empirical problem and suggest that modular, taxonomy-grounded auditing is a practical route to more reliable optimization modeling.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Together, these contributions turn optimization-modeling hallucination detection into a concrete empirical problem and suggest that modular, taxonomy-grounded auditing is a practical route to more…

WHY NOW

LLM Auditing moved forward this cycle; last verified May 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA multi-agent system to detect and audit hallucinations in LLM-generated optimization models by checking structural consistency.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A multi-agent system to detect and audit hallucinations in LLM-generated optimization models by checking structural consistency.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A multi-agent system to detect and audit hallucinations in LLM-generated optimization models by checking structural consistency.

Segment

LLM Auditing

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "fd85a114-dac0-4f43-855b-109f0567358a", "arxiv_id": "2605.11738", "canonical_route": "/paper/optargus-a-multi-agent-system-to-detect-hallucinations-in-llm-based-optimization-modeling", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "optargus-a-multi-agent-system-to-detect-hallucinations-in-llm-based-optimization-modeling", "endpoints": { "paper_pack": "/api/v1/paper/optargus-a-multi-agent-system-to-detect-hallucinations-in-llm-based-optimization-modeling/paper-pack", "build_passport": "/api/v1/paper/optargus-a-multi-agent-system-to-detect-hallucinations-in-llm-based-optimization-modeling/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "OptArgus: A Multi-Agent System to Detect Hallucinations in LLM-based Optimization Modeling", "normalized_query": "2605.11738", "route": "/paper/optargus-a-multi-agent-system-to-detect-hallucinations-in-llm-based-optimization-modeling", "paper_ref": "optargus-a-multi-agent-system-to-detect-hallucinations-in-llm-based-optimization-modeling", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/optargus-a-multi-agent-system-to-detect-hallucinations-in-llm-based-optimization-modeling#webpage", "url": "https://sciencetostartup.com/paper/optargus-a-multi-agent-system-to-detect-hallucinations-in-llm-based-optimization-modeling", "name": "OptArgus: A Multi-Agent System to Detect Hallucinations in LLM-based Optimization Modeling", "description": "A multi-agent system to detect and audit hallucinations in LLM-generated optimization models by checking structural consistency.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/optargus-a-multi-agent-system-to-detect-hallucinations-in-llm-based-optimization-modeling#scholarlyArticle", "headline": "OptArgus: A Multi-Agent System to Detect Hallucinations in LLM-based Optimization Modeling", "description": "A multi-agent system to detect and audit hallucinations in LLM-generated optimization models by checking structural consistency.", "url": "https://sciencetostartup.com/paper/optargus-a-multi-agent-system-to-detect-hallucinations-in-llm-based-optimization-modeling", "sameAs": "https://arxiv.org/abs/2605.11738", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.11738" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-12T08:19:14.000Z", "author": [ { "@type": "Person", "name": "Zhong Li" }, { "@type": "Person", "name": "Zihan Guo" }, { "@type": "Person", "name": "Xiaohan Lu" }, { "@type": "Person", "name": "Juntao Wang" }, { "@type": "Person", "name": "Jie Song" }, { "@type": "Person", "name": "Chao Shen" }, { "@type": "Person", "name": "Jiageng Wu" }, { "@type": "Person", "name": "Mingyang Sun" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Auditing" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Auditing", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "OptArgus: A Multi-Agent System to Detect Hallucinations in L", "item": "https://sciencetostartup.com/paper/optargus-a-multi-agent-system-to-detect-hallucinations-in-llm-based-optimization-modeling" } ] } ] }

Competitive landscape

A multi-agent system to detect and audit hallucinations in LLM-generated optimization models by checking structural consistency.

Segment

LLM Auditing

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

OptArgus: A Multi-Agent System to Detect Hallucinations in LLM-based Optimization Modeling

OptArgus: A Multi-Agent System to Detect Hallucinations in LLM-based Optimization Modeling

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline