Whisper Leak

A new attack that makes it possible to determine the topic of an LLM query from encrypted traffic

4 December 2025

Breaking Agent Backbones

How LLM selection affects agent security

2 December 2025

LOTL Attacks Using Local LLMs

How future devices with built-in LLMs will become a security problem, because attackers will be able to live off the LLM (LOLLM)

30 November 2025

Architecting secure enterprise AI agents with MCP

A guide to designing secure enterprise AI agents using MCP from IBM, with verification from Anthropic

25 November 2025

Defending MLLMs from Implicit Jailbreak Attacks

A new class of attacks where text and image look safe separately, but their combination carries malicious meaning

22 November 2025

Pruning-Activated Attack

Model pruning can be used by an attacker

17 November 2025

External Data Extraction Attacks against RAG

The paper studies a new class of attacks against RAG-type systems

14 November 2025

Fine-Tuning Jailbreaks

The paper discusses vulnerabilities in fine-tuning systems for large language models under conditions close to real-world operation

10 November 2025

Airgeddon loves WiFi

Nobody likes wires; everyone loves Wi-Fi

7 November 2025

Tool Tweak

An attack on tool selection in agentic systems

6 November 2025