ARXIV:2605.02202 · AI SECURITY · SUBMITTED 05 MAY · 20:31 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

CBV: Clean-label Backdoor Attacks on Vision Language Models via Diffusion Models

Ji Guo · Xiaolong Qin · Cencen Liu · Jielei Wang · Jierun Chen · Wenbo Jiang · arXiv

A clean-label backdoor attack on vision-language models using diffusion models to generate natural poisoned examples with triggered image features and multimodal guidance.

Blocked on Code›Score3.0Evidence unverified

Opportunity summary

Pain A clean-label backdoor attack on vision-language models using diffusion models to generate natural poisoned examples with triggered image features and multimodal guidance.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A clean-label backdoor attack on vision-language models using diffusion models to generate natural poisoned examples with triggered image features and multimodal guidance. However, as their applications become increasingly widespread, recent studies have revealed that…

METHOD

Full abstract

Vision-Language Models (VLMs) have achieved remarkable success in tasks such as image captioning and visual question answering (VQA). However, as their applications become increasingly widespread, recent studies have revealed that VLMs are vulnerable to backdoor attacks. Existing backdoor attacks on VLMs primarily rely on data poisoning by adding visual triggers and modifying text labels, where the induced image-text mismatch makes poisoned samples easy to detect. To address this limitation, we propose the Clean-Label Backdoor Attack on VLMs via Diffusion Models (CBV), which leverages diffusion models to generate natural poisoned examples via score matching. Specifically, CBV modifies the score during the reverse generation process of the diffusion model to guide the generation of poisoned samples that contain triggered image features. To further enhance the effectiveness of the attack, we incorporate the textual information of the triggered images as multimodal guidance during generation. Moreover, to enhance stealthiness, we introduce a GradCAM-guided Mask (GM) that restricts modifications to only the most semantically important regions, rather than the entire image. We evaluate our method on MSCOCO and VQA v2 with four representative VLMs, achieving over 80% ASR while preserving normal functionality.

RESULT

ScienceToStartup currently rates this 3.0/10 on the public viability pass. We evaluate our method on MSCOCO and VQA v2 with four representative VLMs, achieving over 80% ASR while preserving normal functionality.

WHY NOW

AI Security moved forward this cycle; last verified May 2026. Public score 3.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score3.0

PainA clean-label backdoor attack on vision-language models using diffusion models to generate natural poisoned examples with triggered image features and multimodal guidance.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

A clean-label backdoor attack on vision-language models using diffusion models to generate natural poisoned examples with triggered image features and multimodal guidance.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A clean-label backdoor attack on vision-language models using diffusion models to generate natural poisoned examples with triggered image features and multimodal guidance.

Segment

AI Security

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "80a6f8c6-15dd-4817-9df9-ac17861791ce", "arxiv_id": "2605.02202", "canonical_route": "/paper/cbv-clean-label-backdoor-attacks-on-vision-language-models-via-diffusion-models", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "cbv-clean-label-backdoor-attacks-on-vision-language-models-via-diffusion-models", "endpoints": { "paper_pack": "/api/v1/paper/cbv-clean-label-backdoor-attacks-on-vision-language-models-via-diffusion-models/paper-pack", "build_passport": "/api/v1/paper/cbv-clean-label-backdoor-attacks-on-vision-language-models-via-diffusion-models/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "CBV: Clean-label Backdoor Attacks on Vision Language Models via Diffusion Models", "normalized_query": "2605.02202", "route": "/paper/cbv-clean-label-backdoor-attacks-on-vision-language-models-via-diffusion-models", "paper_ref": "cbv-clean-label-backdoor-attacks-on-vision-language-models-via-diffusion-models", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/cbv-clean-label-backdoor-attacks-on-vision-language-models-via-diffusion-models#webpage", "url": "https://sciencetostartup.com/paper/cbv-clean-label-backdoor-attacks-on-vision-language-models-via-diffusion-models", "name": "CBV: Clean-label Backdoor Attacks on Vision Language Models via Diffusion Models", "description": "A clean-label backdoor attack on vision-language models using diffusion models to generate natural poisoned examples with triggered image features and multimodal guidance.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/cbv-clean-label-backdoor-attacks-on-vision-language-models-via-diffusion-models#scholarlyArticle", "headline": "CBV: Clean-label Backdoor Attacks on Vision Language Models via Diffusion Models", "description": "A clean-label backdoor attack on vision-language models using diffusion models to generate natural poisoned examples with triggered image features and multimodal guidance.", "url": "https://sciencetostartup.com/paper/cbv-clean-label-backdoor-attacks-on-vision-language-models-via-diffusion-models", "sameAs": "https://arxiv.org/abs/2605.02202", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2605.02202" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-05-04T04:02:23.000Z", "author": [ { "@type": "Person", "name": "Ji Guo" }, { "@type": "Person", "name": "Xiaolong Qin" }, { "@type": "Person", "name": "Cencen Liu" }, { "@type": "Person", "name": "Jielei Wang" }, { "@type": "Person", "name": "Jierun Chen" }, { "@type": "Person", "name": "Wenbo Jiang" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 3 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "AI Security" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "AI Security", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "CBV: Clean-label Backdoor Attacks on Vision Language Models ", "item": "https://sciencetostartup.com/paper/cbv-clean-label-backdoor-attacks-on-vision-language-models-via-diffusion-models" } ] } ] }

Competitive landscape

A clean-label backdoor attack on vision-language models using diffusion models to generate natural poisoned examples with triggered image features and multimodal guidance.

Segment

AI Security

Adoption evidence

No public code link in the paper record yet

Commercial read

3.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

CBV: Clean-label Backdoor Attacks on Vision Language Models via Diffusion Models

CBV: Clean-label Backdoor Attacks on Vision Language Models via Diffusion Models

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline