Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
dTub
Скачать

Building and evaluating AI Agents — Sayash Kapoor, AI Snake Oil

Автор: AI Engineer

Загружено: 2025-04-17

Просмотров: 200824

Описание:

Is 2025 the year of AI agents? Will reasoning models allow agents to solve challenging open problems? From software engineering to web task automation, it has been claimed that agents will solve challenging open problems. Unfortunately, current agents suffer from many shortcomings that reduce their utility in real-world tasks — look no further than Rabbit R1 and the Humane Pin. In this talk, we will explore how current agents fall far short of their claimed performance in the real world and understand best practices for improving agent evaluation. Learn how to avoid known pitfalls and build AI agents that actually matter.

Recorded live at the Agent Engineering Session Day from the AI Engineer Summit 2025 in New York. Learn more at https://ai.engineer and purchase tickets to our next event, the AI Engineer World's Fair, in SF June 3 - 5 here: https://ti.to/software-3/ai-engineer-...

Sayash Kapoor is a Senior Fellow at Mozilla, a Laurance S. Rockefeller Graduate Prize Fellow in the University Center for Human Values, and a computer science Ph.D. candidate at Princeton University's Center for Information Technology Policy. He is a coauthor of AI Snake Oil, a book that provides a critical analysis of artificial intelligence, separating the hype from the true advances. He has written for outlets like WIRED and The Wall Street Journal, and his work has been featured in The New York Times, The Atlantic, Washington Post, Bloomberg, and many others. Kapoor has been recognized with various awards, including TIME’s inaugural list of the 100 most influential people in AI.

Building and evaluating AI Agents — Sayash Kapoor, AI Snake Oil

Поделиться в:

Доступные форматы для скачивания:

Скачать видео mp4

  • Информация по загрузке:

Скачать аудио mp3

Похожие видео

Обучение с подкреплением для агентов — Уилл Браун, исследователь машинного обучения в Morgan Stanley

Обучение с подкреплением для агентов — Уилл Браун, исследователь машинного обучения в Morgan Stanley

Trust, but Verify: Knowledge Agents for Finance Workflows - Mike Conover

Trust, but Verify: Knowledge Agents for Finance Workflows - Mike Conover

Как мы создаем эффективных агентов: Барри Чжан, Anthropic

Как мы создаем эффективных агентов: Барри Чжан, Anthropic

5 тревожных знаков, что ИИ — это пузырь! Как не попасть в ловушку?

5 тревожных знаков, что ИИ — это пузырь! Как не попасть в ловушку?

Stop Using RAG as Memory

Stop Using RAG as Memory

Gemini 3 just crushed everything

Gemini 3 just crushed everything

Training Agentic Reasoners — Will Brown, Prime Intellect

Training Agentic Reasoners — Will Brown, Prime Intellect

Современные подсказки для агентов ИИ

Современные подсказки для агентов ИИ

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

Andrew Ng Explores The Rise Of AI Agents And Agentic Reasoning | BUILD 2024 Keynote

GraphRAG: союз графов знаний и RAG: Эмиль Эйфрем

GraphRAG: союз графов знаний и RAG: Эмиль Эйфрем

#define AI Engineer - Greg Brockman, OpenAI (ft. Jensen Huang)

#define AI Engineer - Greg Brockman, OpenAI (ft. Jensen Huang)

Andrej Karpathy: Software Is Changing (Again)

Andrej Karpathy: Software Is Changing (Again)

RAG vs. CAG: Solving Knowledge Gaps in AI Models

RAG vs. CAG: Solving Knowledge Gaps in AI Models

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

What do tech pioneers think about the AI revolution? - The Engineers, BBC World Service

Новый код — Шон Гроув, OpenAI

Новый код — Шон Гроув, OpenAI

Убийца Chrome? ChatGPT Atlas – тест и вердикт

Убийца Chrome? ChatGPT Atlas – тест и вердикт

Агенты RAG в производстве: 10 уроков, которые мы усвоили — Дауве Киела, создатель RAG

Агенты RAG в производстве: 10 уроков, которые мы усвоили — Дауве Киела, создатель RAG

Architecting Agent Memory: Principles, Patterns, and Best Practices — Richmond Alake, MongoDB

Architecting Agent Memory: Principles, Patterns, and Best Practices — Richmond Alake, MongoDB

Voice Agent Engineering — Nik Caryotakis, SuperDial

Voice Agent Engineering — Nik Caryotakis, SuperDial

AI Snake Oil: What Artificial Intelligence Can Do, What It Can’t, and How to Tell the Difference

AI Snake Oil: What Artificial Intelligence Can Do, What It Can’t, and How to Tell the Difference

© 2025 dtub. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]