What is LLM Red Teaming? How Generative AI Safety Testing Works

Автор: AI Lingua

Загружено: 2025-04-12

Просмотров: 2850

Описание:

A quick introduction to Generative AI Red Teaming (LLM Red Teaming)—what it is, how it's done, and why it matters for AI safety, security, and risk management. Learn about common jailbreaking methods, testing strategies, and model vulnerabilities.

🔗 Jailbreaking Taxonomy: https://innodata.com/llm-jailbreaking...

Presented by Karen McNeil, PhD, Director of LLM Practice and Red Teaming at Innodata.

Illustrations by Yevheniia Lisovaya
Music by Benjamin Tissot, Bensound (License code: Y0ZTXBXFHRDIDQ4G)

Innodata provides comprehensive LLM red teaming services. Visit https://innodata.com to learn more.

00:00 Introduction
00:54 Model Safety
01:53 Why is it called Red Teaming?
02:33 LLM Harms
03:34 Jailbreaking
06:04 Multimodal Red Teaming
07:57 Indirect Prompt Injection
08:57 Automated (LLM vs. LLM) Red Teaming
09:53 Why Humans Still Matter
11:30 Who Should Care About Red Teaming?
12:38 Conclusion

What is LLM Red Teaming? How Generative AI Safety Testing Works

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

AI Red Teaming 101 – Full Course (Episodes 1-10)

AI Red Teaming 101 – Full Course (Episodes 1-10)

Самая сложная модель из тех, что мы реально понимаем

Самая сложная модель из тех, что мы реально понимаем

Hacking GenAI: LLM Red Teaming Secrets & Next-Level AI Strategies

Hacking GenAI: LLM Red Teaming Secrets & Next-Level AI Strategies

The AI Psychosis Epidemic

The AI Psychosis Epidemic

PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI Systems

PyRIT: A Framework for Security Risk Identification and Red Teaming in Generative AI Systems

Тренды в ИИ 2026. К чему готовиться каждому.

Тренды в ИИ 2026. К чему готовиться каждому.

Как крутят нейронки на периферийных устройствах / База по Edge Computing от инженера из Qualcomm

Как крутят нейронки на периферийных устройствах / База по Edge Computing от инженера из Qualcomm

EXPOSING The Billion Dollar SECRET VPN Companies Are Hiding

EXPOSING The Billion Dollar SECRET VPN Companies Are Hiding

How does someone become an AI red teamer? (Quick response video)

How does someone become an AI red teamer? (Quick response video)

Hacking AI is TOO EASY (this should be illegal)

Hacking AI is TOO EASY (this should be illegal)

Точка зрения: что вы увидите во время захвата искусственным интеллектом

Точка зрения: что вы увидите во время захвата искусственным интеллектом

Generative AI Security Top Considerations

Generative AI Security Top Considerations

Prompt Engineering and AI Red Teaming — Sander Schulhoff, HackAPrompt/LearnPrompting

Prompt Engineering and AI Red Teaming — Sander Schulhoff, HackAPrompt/LearnPrompting

LLM Hacking Defense: Strategies for Secure AI

LLM Hacking Defense: Strategies for Secure AI

Как применять нейронки в 2026. Андрей Себрант, Яндекс | подкаст

Как применять нейронки в 2026. Андрей Себрант, Яндекс | подкаст

RED TEAMING: объяснение за 8 минут

RED TEAMING: объяснение за 8 минут

Generative AI's Greatest Flaw - Computerphile

Generative AI's Greatest Flaw - Computerphile

Claude Code: полный гайд по AI-кодингу (хаки, техники и секреты)

Claude Code: полный гайд по AI-кодингу (хаки, техники и секреты)

The EASIEST Way To Hack Every AI Model (Crescendo Jailbreak Method)

The EASIEST Way To Hack Every AI Model (Crescendo Jailbreak Method)

AI Red Teaming & Securing Enterprise AI with Leonard Tang of Haize Labs

AI Red Teaming & Securing Enterprise AI with Leonard Tang of Haize Labs