Anthropic Head of Pretraining on Scaling Laws, Compute, and the Future of AI

Автор: Y Combinator

Загружено: 2025-09-30

Просмотров: 27045

Описание:

Ever wonder what it actually takes to train a frontier AI model?

Ankit Gupta, YC General Partner, sits down with Nick Joseph, Anthropic's Head of Pre-training, to explore the engineering challenges behind training Claude—from managing thousands of GPUs and debugging cursed bugs to balancing compute between pre-training and RL. We cover scaling laws, data strategies, team composition, and why the hardest problems in AI are often infrastructure problems, not ML problems.

Apply to Y Combinator: https://www.ycombinator.com/apply
Work at a startup: https://www.ycombinator.com/jobs

Chapters:
00:00 – Introduction
01:05 – From Vicarious to OpenAI to Anthropic
06:40 – What pretraining is
11:20 – Why next-word prediction won out
16:05 – Scaling laws and the feedback loop of compute → models → revenue
21:50 – Building Anthropic’s early infrastructure
27:35 – Efficiency hacks and debugging at scale
33:10 – Generalists vs. specialists on the pretraining team
38:45 – Challenges of training across thousands of GPUs
44:15 – Working with new chips: GPUs vs. TPUs
49:00 – Pretraining vs. post-training (RLHF and reasoning models)
54:25 – The future of data quality and availability
59:10 – Where pretraining goes next
1:03:00 – Closing reflections

Anthropic Head of Pretraining on Scaling Laws, Compute, and the Future of AI

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Anthropic Co-founder: Building Claude Code, Lessons From GPT-3 & LLM System Design

Anthropic Co-founder: Building Claude Code, Lessons From GPT-3 & LLM System Design

Alexandr Wang: Building Scale AI, Transforming Work With Agents & Competing With China

Alexandr Wang: Building Scale AI, Transforming Work With Agents & Competing With China

AI без хайпа: как всё работает на самом деле? Александр Машрабов и первый казахстанский единорог

AI без хайпа: как всё работает на самом деле? Александр Машрабов и первый казахстанский единорог

Scaling and the Road to Human-Level AI | Anthropic Co-founder Jared Kaplan

Scaling and the Road to Human-Level AI | Anthropic Co-founder Jared Kaplan

Вертикальные ИИ-агенты могут быть в 10 раз крупнее SaaS

Вертикальные ИИ-агенты могут быть в 10 раз крупнее SaaS

Interpretability: Understanding how AI models think

Interpretability: Understanding how AI models think

Inside Google DeepMind: AGI, Robotics, & World Models Explained - Demis Hassabis

Inside Google DeepMind: AGI, Robotics, & World Models Explained - Demis Hassabis

Andrej Karpathy: Software Is Changing (Again)

Andrej Karpathy: Software Is Changing (Again)

The Truth About The AI Bubble

The Truth About The AI Bubble

Современные подсказки для агентов ИИ

Современные подсказки для агентов ИИ

Deep Dive into LLMs like ChatGPT

Deep Dive into LLMs like ChatGPT

The FDE Playbook for AI Startups with Bob McGrew

The FDE Playbook for AI Startups with Bob McGrew

Yann LeCun | Self-Supervised Learning, JEPA, World Models, and the future of AI

Yann LeCun | Self-Supervised Learning, JEPA, World Models, and the future of AI

Satya Nadella – How Microsoft thinks about AGI

Satya Nadella – How Microsoft thinks about AGI

913: LLM Pre-Training and Post-Training 101 — with Julien Launay

913: LLM Pre-Training and Post-Training 101 — with Julien Launay

СЕО Майкрософт AI: Главный навык эпохи ИИ, который защитит твою карьеру | Мустафа Сулейман

СЕО Майкрософт AI: Главный навык эпохи ИИ, который защитит твою карьеру | Мустафа Сулейман

How OpenAI Shapes Its Research And What's Next - EP 46 Mark Chen

How OpenAI Shapes Its Research And What's Next - EP 46 Mark Chen

François Chollet: How We Get To AGI

François Chollet: How We Get To AGI

От идеи до выхода на 650 миллионов долларов: уроки создания стартапов в сфере ИИ

От идеи до выхода на 650 миллионов долларов: уроки создания стартапов в сфере ИИ

Codex and the future of coding with AI — the OpenAI Podcast Ep. 6

Codex and the future of coding with AI — the OpenAI Podcast Ep. 6