ARXIV:2603.22041 · GENERATIVE AI SAFETY · SUBMITTED 02 APR · 02:30 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation

Binhong Tan · Zhaoxin Wang · Handing Wang · arXiv

A dual-stage defense framework for safe text-to-image generation that intervenes at both textual and visual stages to capture and attenuate unsafe content.

Ship in 2-4 weeks›Score7.0Evidence unverified

Opportunity summary

Pain A dual-stage defense framework for safe text-to-image generation that intervenes at both textual and visual stages to capture and attenuate unsafe content.

Evidence 0 refs | 0 sources | 17% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

A dual-stage defense framework for safe text-to-image generation that intervenes at both textual and visual stages to capture and attenuate unsafe content. Existing inference-time defense methods typically perform category-agnostic token-level intervention in the text…

METHOD

Full abstract

Text-to-Image (T2I) diffusion models have demonstrated strong generation ability, but their potential to generate unsafe content raises significant safety concerns. Existing inference-time defense methods typically perform category-agnostic token-level intervention in the text embedding space, which fails to capture malicious semantics distributed across the full token sequence and remains vulnerable to adversarial prompts. In this paper, we propose DTVI, a dual-stage inference-time defense framework for safe T2I generation. Unlike existing methods that intervene on specific token embeddings, our method introduces category-aware sequence-level intervention on the full prompt embedding to better capture distributed malicious semantics, and further attenuates the remaining unsafe influences during the visual generation stage. Experimental results on real-world unsafe prompts, adversarial prompts, and multiple harmful categories show that our method achieves effective and robust defense while preserving reasonable generation quality on benign prompts, obtaining an average Defense Success Rate (DSR) of 94.43% across sexual-category benchmarks and 88.56 across seven unsafe categories, while maintaining generation quality on benign prompts.

RESULT

ScienceToStartup currently rates this 7.0/10 on the public viability pass. Experimental results on real-world unsafe prompts, adversarial prompts, and multiple harmful categories show that our method achieves effective and robust defense while preserving reasonable…

WHY NOW

Generative AI Safety moved forward this cycle; last verified April 2026. Public score 7.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score7.0

PainA dual-stage defense framework for safe text-to-image generation that intervenes at both textual and visual stages to capture and attenuate unsafe content.

Evidence0 refs | 0 sources | 17% coverage

Blockerno shell-level blocker reported

Analysis summary

A dual-stage defense framework for safe text-to-image generation that intervenes at both textual and visual stages to capture and attenuate unsafe content.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

A dual-stage defense framework for safe text-to-image generation that intervenes at both textual and visual stages to capture and attenuate unsafe content.

Segment

Generative AI Safety

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "3a6a7883-b57f-4914-8d95-87a4aeed2382", "arxiv_id": "2603.22041", "canonical_route": "/paper/dtvi-dual-stage-textual-and-visual-intervention-for-safe-text-to-image-generation", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "dtvi-dual-stage-textual-and-visual-intervention-for-safe-text-to-image-generation", "endpoints": { "paper_pack": "/api/v1/paper/dtvi-dual-stage-textual-and-visual-intervention-for-safe-text-to-image-generation/paper-pack", "build_passport": "/api/v1/paper/dtvi-dual-stage-textual-and-visual-intervention-for-safe-text-to-image-generation/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation", "normalized_query": "2603.22041", "route": "/paper/dtvi-dual-stage-textual-and-visual-intervention-for-safe-text-to-image-generation", "paper_ref": "dtvi-dual-stage-textual-and-visual-intervention-for-safe-text-to-image-generation", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/dtvi-dual-stage-textual-and-visual-intervention-for-safe-text-to-image-generation#webpage", "url": "https://sciencetostartup.com/paper/dtvi-dual-stage-textual-and-visual-intervention-for-safe-text-to-image-generation", "name": "DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation", "description": "A dual-stage defense framework for safe text-to-image generation that intervenes at both textual and visual stages to capture and attenuate unsafe content.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/dtvi-dual-stage-textual-and-visual-intervention-for-safe-text-to-image-generation#scholarlyArticle", "headline": "DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation", "description": "A dual-stage defense framework for safe text-to-image generation that intervenes at both textual and visual stages to capture and attenuate unsafe content.", "url": "https://sciencetostartup.com/paper/dtvi-dual-stage-textual-and-visual-intervention-for-safe-text-to-image-generation", "sameAs": "https://arxiv.org/abs/2603.22041", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2603.22041" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-03-23T14:41:11.000Z", "author": [ { "@type": "Person", "name": "Binhong Tan" }, { "@type": "Person", "name": "Zhaoxin Wang" }, { "@type": "Person", "name": "Handing Wang" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 7 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Generative AI Safety" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Generative AI Safety", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "DTVI: Dual-Stage Textual and Visual Intervention for Safe Te", "item": "https://sciencetostartup.com/paper/dtvi-dual-stage-textual-and-visual-intervention-for-safe-text-to-image-generation" } ] } ] }

Competitive landscape

A dual-stage defense framework for safe text-to-image generation that intervenes at both textual and visual stages to capture and attenuate unsafe content.

Segment

Generative AI Safety

Adoption evidence

No public code link in the paper record yet

Commercial read

7.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation

DTVI: Dual-Stage Textual and Visual Intervention for Safe Text-to-Image Generation

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline