Why Language Models Hallucinate - Adam Kalai

Автор: Institute for Advanced Study

Загружено: 2025-11-24

Просмотров: 10890

Описание:

Computer Science/Discrete Mathematics Seminar I
11:00am|Simonyi Hall 101 and Remote Access
Topic: Why Language Models Hallucinate
Speaker: Adam Kalai
Affiliation: Open AI
Date: November 24, 2025

Large language models (LLMs) sometimes generate statements that are plausible but factually incorrect—a phenomenon commonly called "hallucination." We argue that these errors are not mysterious failures of architecture or reasoning, but rather predictable consequences of standard training and evaluation incentives.

We show (i) that hallucinations can be viewed as classification errors: when pretrained models cannot reliably distinguish a false statement from a true one, they may produce the false option rather than saying I don't know; (ii) that optimization of benchmark performance encourages guessing rather than abstaining, since most evaluation metrics penalize expressing uncertainty; and (iii) that a possible mitigation path lies in revising existing benchmarks to reward calibrated abstention, thus realigning incentives in model development.

Joint work with Santosh Vempala (Georgia Tech) and Ofir Nachum & Edwin Zhang (OpenAI).

Why Language Models Hallucinate - Adam Kalai

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Local List Decoding from HDX II - Yotam Dikstein

Local List Decoding from HDX II - Yotam Dikstein

Будем Наблюдать / Алексей Венедиктов* и Сергей Бунтман // 24.01.26

Будем Наблюдать / Алексей Венедиктов* и Сергей Бунтман // 24.01.26

Introduction to Mechanistic Interpretability with David Bau

Introduction to Mechanistic Interpretability with David Bau

Why Large Language Models Hallucinate

Why Large Language Models Hallucinate

Почему спагетти-код лучше чистой архитектуры

Почему спагетти-код лучше чистой архитектуры

What Is Understanding? – Geoffrey Hinton | IASEAI 2025

What Is Understanding? – Geoffrey Hinton | IASEAI 2025

Как и зачем охлаждают атомы — Семихатов, Вишнякова

Как и зачем охлаждают атомы — Семихатов, Вишнякова

Может ли у ИИ появиться сознание? — Семихатов, Анохин

Может ли у ИИ появиться сознание? — Семихатов, Анохин

ИИ - ЭТО ИЛЛЮЗИЯ ИНТЕЛЛЕКТА. Но что он такое и почему совершил революцию?

ИИ - ЭТО ИЛЛЮЗИЯ ИНТЕЛЛЕКТА. Но что он такое и почему совершил революцию?

В чем разница между матрицами и тензорами?

В чем разница между матрицами и тензорами?

Sir Demis Hassabis on The Future of Knowledge | Institute for Advanced Study

Sir Demis Hassabis on The Future of Knowledge | Institute for Advanced Study

Когда ИИ Полностью ЗАМЕНИТ ЧЕЛОВЕКА? | Либерманы

Когда ИИ Полностью ЗАМЕНИТ ЧЕЛОВЕКА? | Либерманы

Claude Code Ends SaaS, the Gemini + Siri Partnership, and Math Finally Solves AI | #224

Claude Code Ends SaaS, the Gemini + Siri Partnership, and Math Finally Solves AI | #224

Do LLMs Understand? AI Pioneer Yann LeCun Spars with DeepMind’s Adam Brown.

Do LLMs Understand? AI Pioneer Yann LeCun Spars with DeepMind’s Adam Brown.

Terence Tao on Grigori Perelman solving Poincare Conjecture | Lex Fridman Podcast Clips

Terence Tao on Grigori Perelman solving Poincare Conjecture | Lex Fridman Podcast Clips

Управление поведением LLM без тонкой настройки

Управление поведением LLM без тонкой настройки

Как внимание стало настолько эффективным [GQA/MLA/DSA]

Как внимание стало настолько эффективным [GQA/MLA/DSA]

The most complex model we actually understand

The most complex model we actually understand

The future of intelligence | Demis Hassabis (Co-founder and CEO of DeepMind)

The future of intelligence | Demis Hassabis (Co-founder and CEO of DeepMind)

LLM fine-tuning или ОБУЧЕНИЕ малой модели? Мы проверили!

LLM fine-tuning или ОБУЧЕНИЕ малой модели? Мы проверили!