AI Security

Proof pending

96papers

5.5viability

-29%30d

Proof pending

Proof pending. Core topic summary fields are still materializing.

State of the Field

Recent advancements in AI security are addressing critical vulnerabilities in generative models and automated systems. One notable trend is the development of latent space watermarking techniques, which enhance the robustness and efficiency of watermarking AI-generated content, potentially mitigating copyright infringement and misuse. Concurrently, tools like HubScan are being introduced to detect hubness poisoning in retrieval-augmented generation systems, a significant security threat that can manipulate content retrieval and filtering. The emergence of frameworks such as Jailbreak Foundry is facilitating reproducible benchmarking of jailbreak techniques for large language models, ensuring that security assessments remain relevant amid rapidly evolving threats. Additionally, innovative approaches like SpecularNet are enabling reference-free phishing detection, improving scalability and practicality in combating web fraud. These efforts reflect a growing recognition of the need for proactive security measures in AI applications, as researchers strive to create more resilient systems capable of withstanding sophisticated attacks.

Last updated May 22, 2026

AI Security

Proof pending

State of the Field

Top Questions

Topic trend

Papers

Learning to Watermark in the Latent Space of Generative Models

Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking

HubScan: Detecting Hubness Poisoning in Retrieval-Augmented Generation Systems

BlackMirror: Black-Box Backdoor Detection for Text-to-Image Models via Instruction-Response Deviation

Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting

Comment and Control: Hijacking Agentic Workflows via Context-Grounded Evolution

AgenticSCR: An Autonomous Agentic Secure Code Review for Immature Vulnerabilities Detection

ClawGuard: A Runtime Security Framework for Tool-Augmented LLM Agents Against Indirect Prompt Injection

Phishing the Phishers with SpecularNet: Hierarchical Graph Autoencoding for Reference-Free Web Phishing Detection

SkillSieve: A Hierarchical Triage Framework for Detecting Malicious AI Agent Skills

Filters

Topic proof surfaces

AI Security

Use this topic page as a durable research-area proof surface