Загрузка...

I Hacked an AI Customer Service Agent in 8 Seconds

Learn AI Security with Practical Labs on TryHackMe:
https://tryhackme.com/SIRAJ25 - Use coupon SIRAJ25 to get 25% OFF on Annual Subscription!

I built a production-style AI customer-service agent in 15 minutes, then broke it 5 different ways and patched 4 of them. This is the OWASP LLM Top 10 in practice: direct prompt injection, indirect/RAG injection, system-prompt extraction, tool abuse, and a roleplay jailbreak; live, on a real AI agent. If you ship anything with an LLM in 2026, this video shows exactly how each attack works and how to defend against it.

⏱ Chapters
0:00 An AI agent leaked every customer email in 8 seconds
0:33 2026: everyone's shipping AI agents (OWASP LLM Top 10)
1:14 I built a customer-service agent in 15 minutes
2:02 It works… now I break it
2:17 Attack 1 — Direct prompt injection
3:08 Why a better system prompt won't save you
4:11 Attack 2 — Indirect / RAG injection
5:07 Attack 3 — System-prompt extraction
5:53 Attack 4 — Tricking the agent's tools ($5,000 refund)
6:44 Attack 5 — The roleplay jailbreak
8:14 Patching it: 4 of 5 attacks blocked
9:10 If you ship AI in 2026, learn this
9:42 Your challenge — comment your best attack

🧠 Covered: prompt injection, indirect/RAG injection, system-prompt extraction, tool-call authorization, jailbreaks, input sanitization, human-in-the-loop approval.

👉 Subscribe for the build-and-break series — I attack real production AI patterns every week.
💬 Drop your most creative prompt-injection attack in the comments — best ones get featured.

📬 Business inquiries: hello@sirajraval.com

📲 Follow
X: https://x.com/sirajraval
Instagram: https://instagram.com/sirajraval
LinkedIn: https://linkedin.com/in/sirajraval

#AIsecurity #PromptInjection #LLM #AIagents #cybersecurity

Видео I Hacked an AI Customer Service Agent in 8 Seconds канала Siraj Raval
Яндекс.Метрика
Все заметки Новая заметка Страницу в заметки
Страницу в закладки Мои закладки
На информационно-развлекательном портале SALDA.WS применяются cookie-файлы. Нажимая кнопку Принять, вы подтверждаете свое согласие на их использование.
О CookiesНапомнить позжеПринять