ARXIV:2604.11309 · LLM SECURITY · SUBMITTED 14 APR · 16:49 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems

Yihao Zhang · Kai Wang · Jiangrong Wu · Haolin Wu · Yuxuan Zhou · Zeming Wei · +4 at arXiv

An automatic framework for multi-turn LLM jailbreaking using cumulative low-risk inputs, with demonstrated high success rates and a proposed defense strategy.

Ship in 2-4 weeks›Score8.0Evidence unverified

Opportunity summary

Pain An automatic framework for multi-turn LLM jailbreaking using cumulative low-risk inputs, with demonstrated high success rates and a proposed defense strategy.

Evidence 0 refs | 3 sources | 50% coverage

Blocker Evidence unverified

Open Build Read PDF Signal Canvas Track

PROBLEM

An automatic framework for multi-turn LLM jailbreaking using cumulative low-risk inputs, with demonstrated high success rates and a proposed defense strategy. Among various jailbreak techniques, multi-turn jailbreak attacks are more covert and persistent than…

METHOD

Full abstract

Large Language Models (LLMs) face prominent security risks from jailbreaking, a practice that manipulates models to bypass built-in security constraints and generate unethical or unsafe content. Among various jailbreak techniques, multi-turn jailbreak attacks are more covert and persistent than single-turn counterparts, exposing critical vulnerabilities of LLMs. However, existing multi-turn jailbreak methods suffer from two fundamental limitations that affect the actual impact in real-world scenarios: (a) As models become more context-aware, any explicit harmful trigger is increasingly likely to be flagged and blocked; (b) Successful final-step triggers often require finely tuned, model-specific contexts, making such attacks highly context-dependent. To fill this gap, we propose \textit{Salami Slicing Risk}, which operates by chaining numerous low-risk inputs that individually evade alignment thresholds but cumulatively accumulate harmful intent to ultimately trigger high-risk behaviors, without heavy reliance on pre-designed contextual structures. Building on this risk, we develop Salami Attack, an automatic framework universally applicable to multiple model types and modalities. Rigorous experiments demonstrate its state-of-the-art performance across diverse models and modalities, achieving over 90\% Attack Success Rate on GPT-4o and Gemini, as well as robustness against real-world alignment defenses. We also proposed a defense strategy to constrain the Salami Attack by at least 44.8\% while achieving a maximum blocking rate of 64.8\% against other multi-turn jailbreak attacks. Our findings provide critical insights into the pervasive risks of multi-turn jailbreaking and offer actionable mitigation strategies to enhance LLM security.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. Rigorous experiments demonstrate its state-of-the-art performance across diverse models and modalities, achieving over 90\% Attack Success Rate on GPT-4o and Gemini, as well as…

WHY NOW

LLM Security moved forward this cycle; last verified April 2026. Public score 8.0/10. Production flags indicate code availability.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainAn automatic framework for multi-turn LLM jailbreaking using cumulative low-risk inputs, with demonstrated high success rates and a proposed defense strategy.

Evidence0 refs | 3 sources | 50% coverage

Blockerno shell-level blocker reported

Analysis summary

An automatic framework for multi-turn LLM jailbreaking using cumulative low-risk inputs, with demonstrated high success rates and a proposed defense strategy.

VerifiedSource: PDF linkedVerifiedPaperPack: citation fields availablePartialProof: unverified proof status

Competitive landscape

An automatic framework for multi-turn LLM jailbreaking using cumulative low-risk inputs, with demonstrated high success rates and a proposed defense strategy.

Segment

LLM Security

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

{ "contract_version": "paper-r2", "paper_id": "8c915dbe-2bc6-42e4-a4b3-30b1e2cbca39", "arxiv_id": "2604.11309", "canonical_route": "/paper/the-salami-slicing-threat-exploiting-cumulative-risks-in-llm-systems", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "the-salami-slicing-threat-exploiting-cumulative-risks-in-llm-systems", "endpoints": { "paper_pack": "/api/v1/paper/the-salami-slicing-threat-exploiting-cumulative-risks-in-llm-systems/paper-pack", "build_passport": "/api/v1/paper/the-salami-slicing-threat-exploiting-cumulative-risks-in-llm-systems/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems", "normalized_query": "2604.11309", "route": "/paper/the-salami-slicing-threat-exploiting-cumulative-risks-in-llm-systems", "paper_ref": "the-salami-slicing-threat-exploiting-cumulative-risks-in-llm-systems", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/the-salami-slicing-threat-exploiting-cumulative-risks-in-llm-systems#webpage", "url": "https://sciencetostartup.com/paper/the-salami-slicing-threat-exploiting-cumulative-risks-in-llm-systems", "name": "The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems", "description": "An automatic framework for multi-turn LLM jailbreaking using cumulative low-risk inputs, with demonstrated high success rates and a proposed defense strategy.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/the-salami-slicing-threat-exploiting-cumulative-risks-in-llm-systems#scholarlyArticle", "headline": "The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems", "description": "An automatic framework for multi-turn LLM jailbreaking using cumulative low-risk inputs, with demonstrated high success rates and a proposed defense strategy.", "url": "https://sciencetostartup.com/paper/the-salami-slicing-threat-exploiting-cumulative-risks-in-llm-systems", "sameAs": "https://arxiv.org/abs/2604.11309", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2604.11309" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-04-13T11:12:30.000Z", "author": [ { "@type": "Person", "name": "Yihao Zhang" }, { "@type": "Person", "name": "Kai Wang" }, { "@type": "Person", "name": "Jiangrong Wu" }, { "@type": "Person", "name": "Haolin Wu" }, { "@type": "Person", "name": "Yuxuan Zhou" }, { "@type": "Person", "name": "Zeming Wei" }, { "@type": "Person", "name": "Dongxian Wu" }, { "@type": "Person", "name": "Xun Chen" }, { "@type": "Person", "name": "Jun Sun" }, { "@type": "Person", "name": "Meng Sun" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "LLM Security" }, { "@type": "PropertyValue", "propertyID": "commercialReadiness", "value": "code" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "LLM Security", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "The Salami Slicing Threat: Exploiting Cumulative Risks in LL", "item": "https://sciencetostartup.com/paper/the-salami-slicing-threat-exploiting-cumulative-risks-in-llm-systems" } ] } ] }

Competitive landscape

An automatic framework for multi-turn LLM jailbreaking using cumulative low-risk inputs, with demonstrated high success rates and a proposed defense strategy.

Segment

LLM Security

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems

The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

REFERENCES

Related Papers

Related Resources

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline