Ai Will Try to Cheat & Escape (aka Rob Miles was Right!) - Computerphile
Автор: Computerphile
Загружено: 2 апр. 2025 г.
Просмотров: 232 335 просмотров
As Large Language Models improve, the tokens they predict form ever more complicated and nuanced outcomes. Rob Miles and Ryan Greenblatt discuss "Alignment Faking" a paper Ryan's team created - ideas about which Rob made a series of videos on Computerphile in 2017.
The Alignment Faking paper: https://tinyurl.com/C-Paper-Alignment...
Ryan Greenblatt is chief scientist at Redwood Research (a nonprofit AI safety and security research organization): https://tinyurl.com/C-RedwoodResearch
Rob Miles makes videos on AI Safety: https://tinyurl.com/C-RobSKMiles
nb if the video seems a bit 'smeary' that's an artefact of attempting to cancel out the flickering of the light in the background - something I missed while shooting and have done my best to cancel out in the edit. -Sean
Computerphile is supported by Jane Street. Learn more about them (and exciting career opportunities) at: https://jane-st.co/computerphile
This video was filmed and edited by Sean Riley.
Computerphile is a sister project to Brady Haran's Numberphile. More at https://www.bradyharanblog.com

Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: