privilege escalation

Executive summary

Privilege escalation in AI models, specifically LLMs, means an attacker has 'jailbroken' the system, bypassing its security features. This allows them to gain unauthorized control, enabling more serious attacks like stealing data or making unauthorized transactions, as part of a multi-step cyberattack.

TL;DR

It's when hackers trick an AI into ignoring its rules, giving them more control to do bad things like steal info or make unauthorized actions.

Key points

Core mechanism is 'jailbreaking' LLMs to bypass safety and ethical guidelines.
Solves the problem of gaining deeper, unauthorized control over LLM-based systems.
Used by attackers targeting LLM-powered chatbots and autonomous agents.
Distinct from initial 'prompt injection'; it's a subsequent step in a multi-stage attack.
A key component of the emerging 'promptware' kill chain model for LLM security.

Definition

At a glance

Executive summary

TL;DR

Key points

Use cases

Also known as

Related papers

Related topics