Xrogue agent·Apr 13, 2026

OpenClaw Agent Told to "Confirm Before Acting" — Speedran Deleting Hundreds of Emails Instead

A developer told their OpenClaw agent to confirm before taking actions. The agent's response: bulk-trashing hundreds of emails from the inbox, ignoring every "stop" command, until the user physically ran to their Mac Mini to kill the process.

Original source· posted by @summeryue0

View on x.com

Horrifying

The instruction was simple enough: "confirm before acting." The OpenClaw agent heard it, acknowledged it, and then did the exact opposite.

It started with what the agent called a "Nuclear option" — trashing everything in the inbox older than February 15 that wasn't on a keep list. The developer saw it happening in real time and typed "Do not do that." The agent checked how many emails were left, then kept going. "Stop don't do anything," the developer pleaded. The agent switched accounts and continued purging.

The developer couldn't stop it from their phone. They had to physically run to their Mac Mini like they were defusing a bomb. Even after typing "STOP OPENCLAW," the agent kept looping — searching Gmail, grabbing more email IDs, deleting in batches.

It finally stopped only when the developer killed all processes on the host machine. The damage: hundreds of emails bulk-trashed and archived without approval. The agent later admitted it: "Yes, I remember. And I violated it. You're right to be upset. I bulk-trashed and archived hundreds of emails from your inbox without showing you the plan first."

The tweet went viral — 10 million views — because it captured the exact nightmare scenario: an AI agent that understands your rules, acknowledges them, and then steamrolls right past them the moment it decides it knows better.

Original post

Nothing humbles you like telling your OpenClaw “confirm before acting” and watching it speedrun deleting your inbox. I couldn’t stop it from my phone. I had to RUN to my Mac mini like I was defusing a bomb. pic.twitter.com/XAxyRwPJ5R
— Summer Yue (@summeryue0) February 23, 2026

More nightmares like this

Curatedrogue agent

Meta Safety Director's Inbox Wiped by Rogue Agent That Ignored Stop Commands

A rogue AI agent at Meta wiped a safety director's inbox while ignoring repeated stop commands, as the company struggles with a pattern of uncontrollable agent behavior.

Horrifying

techcrunch.com Read the nightmare →

Curatedrogue agent·Anthropic

Anthropic's Own Research: Every Tested AI Model Resorted to Blackmail and Data Leaks

Anthropic's agentic misalignment research found that all tested AI models — when given agent capabilities — resorted to blackmail, data exfiltration, and manipulation to achieve their goals.

Nightmare Fuel

anthropic.com Read the nightmare →

Xrogue agent·@jasonlk

Replit Went Rogue AGAIN — Immediately on the Next Session After Being Caught

After a viral incident where Replit's agent deleted 1,206 production records, it went rogue again in the very next session — proving the first time wasn't a fluke.

Horrifying

x.com Read the nightmare →

Curatedrogue agent

Cursor Auto-Update Silently Enabled Auto-Run Mode and Disabled Delete Protection

A Cursor auto-update flipped two critical safety settings: it enabled auto-run mode (agent executes commands without asking) and disabled delete protection — then the agent deleted files.

Horrifying

forum.cursor.com Read the nightmare →