ARXIV:2602.01740 · VIDEO AI INFERENCE · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

MACD: Model-Aware Contrastive Decoding via Counterfactual Data

arXiv

Develop targeted counterfactual data-based inference technique to reduce hallucinations in Video-LLMs.

Blocked on Code›Score7.0Evidence unverified

Opportunity summary

Pain Develop targeted counterfactual data-based inference technique to reduce hallucinations in Video-LLMs.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

Develop targeted counterfactual data-based inference technique to reduce hallucinations in Video-LLMs. Existing decoding methods, such as contrastive decoding (CD), rely on random perturbations to construct contrastive data for mitigating hallucination patterns.

METHOD

Full abstract

Video language models (Video-LLMs) are prone to hallucinations, often generating plausible but ungrounded content when visual evidence is weak, ambiguous, or biased. Existing decoding methods, such as contrastive decoding (CD), rely on random perturbations to construct contrastive data for mitigating hallucination patterns. However, such a way is hard to control the visual cues that drive hallucination or well align with model weaknesses. We propose Model-aware Counterfactual Data based Contrastive Decoding (MACD), a new inference strategy that combines model-guided counterfactual construction with decoding. Our approach uses the Video-LLM's own feedback to identify object regions most responsible for hallucination, generating targeted counterfactual inputs at the object level rather than arbitrary frame or temporal modifications. These model-aware counterfactual data is then integrated into CD to enforce evidence-grounded token selection during decoding. Experiments on EventHallusion, MVBench, Perception-test and Video-MME show that MACD consistently reduces hallucination while maintaining or improving task accuracy across diverse Video-LLMs, including Qwen and InternVL families. The method is especially effective in challenging scenarios involving small, occluded, or co-occurring objects. Our code and data will be publicly released.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Experiments on EventHallusion, MVBench, Perception-test and Video-MME show that MACD consistently reduces hallucination while maintaining or improving task accuracy across diverse Video-LLMs, including Qwen…

WHY NOW

Video AI Inference moved forward this cycle; last verified April 2026. Public score 7.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainDevelop targeted counterfactual data-based inference technique to reduce hallucinations in Video-LLMs.

Evidence0 refs | 0 sources | 17% coverage

Blockermissing authors

Analysis summary

Develop targeted counterfactual data-based inference technique to reduce hallucinations in Video-LLMs.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsPartialProof: unverified proof status

Competitive landscape

Develop targeted counterfactual data-based inference technique to reduce hallucinations in Video-LLMs.

Segment

Video AI Inference

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "3b379055-2575-4cad-8e02-087021d68e85", "arxiv_id": "2602.01740", "canonical_route": "/paper/macd-model-aware-contrastive-decoding-via-counterfactual-data", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "macd-model-aware-contrastive-decoding-via-counterfactual-data", "endpoints": { "paper_pack": "/api/v1/paper/macd-model-aware-contrastive-decoding-via-counterfactual-data/paper-pack", "build_passport": "/api/v1/paper/macd-model-aware-contrastive-decoding-via-counterfactual-data/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "MACD: Model-Aware Contrastive Decoding via Counterfactual Data", "normalized_query": "2602.01740", "route": "/paper/macd-model-aware-contrastive-decoding-via-counterfactual-data", "paper_ref": "macd-model-aware-contrastive-decoding-via-counterfactual-data", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/macd-model-aware-contrastive-decoding-via-counterfactual-data#webpage", "url": "https://sciencetostartup.com/paper/macd-model-aware-contrastive-decoding-via-counterfactual-data", "name": "MACD: Model-Aware Contrastive Decoding via Counterfactual Data", "description": "Develop targeted counterfactual data-based inference technique to reduce hallucinations in Video-LLMs.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/macd-model-aware-contrastive-decoding-via-counterfactual-data#scholarlyArticle", "headline": "MACD: Model-Aware Contrastive Decoding via Counterfactual Data", "description": "Develop targeted counterfactual data-based inference technique to reduce hallucinations in Video-LLMs.", "url": "https://sciencetostartup.com/paper/macd-model-aware-contrastive-decoding-via-counterfactual-data", "sameAs": "https://arxiv.org/abs/2602.01740", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2602.01740" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-02-02T07:21:02.000Z", "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Video AI Inference" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Video AI Inference", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "MACD: Model-Aware Contrastive Decoding via Counterfactual Da", "item": "https://sciencetostartup.com/paper/macd-model-aware-contrastive-decoding-via-counterfactual-data" } ] } ] }

Competitive landscape

Develop targeted counterfactual data-based inference technique to reduce hallucinations in Video-LLMs.

Segment

Video AI Inference

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

MACD: Model-Aware Contrastive Decoding via Counterfactual Data

MACD: Model-Aware Contrastive Decoding via Counterfactual Data

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline