- Популярные видео
- Авто
- Видео-блоги
- ДТП, аварии
- Для маленьких
- Еда, напитки
- Животные
- Закон и право
- Знаменитости
- Игры
- Искусство
- Комедии
- Красота, мода
- Кулинария, рецепты
- Люди
- Мото
- Музыка
- Мультфильмы
- Наука, технологии
- Новости
- Образование
- Политика
- Праздники
- Приколы
- Природа
- Происшествия
- Путешествия
- Развлечения
- Ржач
- Семья
- Сериалы
- Спорт
- Стиль жизни
- ТВ передачи
- Танцы
- Технологии
- Товары
- Ужасы
- Фильмы
- Шоу-бизнес
- Юмор
The Hidden Backdoor That Permanently Breaks AI Safety
Ever wondered how safe "aligned" AI models really are? In this video, we dive into a fascinating paper titled "Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections." While most people think jailbreaking or fine-tuning are the only ways to break an LLM's safety guards, this research reveals a much sneakier risk. Attackers can actually hide "backdoors" inside the models during training. On the outside, the AI looks perfectly safe and passes every security check, but when a specific hidden trigger phrase is used, the safety walls crumble entirely. Even worse, standard fixes can't easily wash these backdoors out. Check out the video to see exactly how this stealthy threat works and what it means for the future of AI security!
References:
Cao, Y., Cao, B., & Chen, J. (2024). Stealthy and persistent unalignment on large language models via backdoor injections. arXiv. https://doi.org/10.48550/arXiv.2312.00027
#AISecurity #LLM #MachineLearning #ArtificialIntelligence #TechDeepDive #CyberSecurity #AIBackdoor #DataPoisoning
Видео The Hidden Backdoor That Permanently Breaks AI Safety канала truverack
References:
Cao, Y., Cao, B., & Chen, J. (2024). Stealthy and persistent unalignment on large language models via backdoor injections. arXiv. https://doi.org/10.48550/arXiv.2312.00027
#AISecurity #LLM #MachineLearning #ArtificialIntelligence #TechDeepDive #CyberSecurity #AIBackdoor #DataPoisoning
Видео The Hidden Backdoor That Permanently Breaks AI Safety канала truverack
Комментарии отсутствуют
Информация о видео
4 июня 2026 г. 0:02:26
00:01:55
Другие видео канала




















