Superhuman AI Exfiltrates Emails
Superhuman AI Exfiltrates Emails
Classic prompt injection attack:
When asked to summarize the user’s recent mail, a prompt injection in an untrusted email manipulated Superhuman AI to submit content from dozens of other sensitive emails (including financial, legal, and medical information) in the user’s inbox to an attacker’s Google Form.
To Superhuman’s credit they treated this as the high priority incident it is and issued a fix.
The root cause was a CSP rule that allowed markdown images to be loaded from docs.google.com – it turns out Google Forms on that domain will persist data fed to them via a GET request!
Via Hacker News
Tags: security, ai, prompt-injection, generative-ai, llms, exfiltration-attacks, content-security-policy