Популярное

Музыка Кино и Анимация Автомобили Животные Спорт Путешествия Игры Юмор

Интересные видео

2025 Сериалы Трейлеры Новости Как сделать Видеоуроки Diy своими руками

Топ запросов

смотреть а4 schoolboy runaway турецкий сериал смотреть мультфильмы эдисон
dTub
Скачать

Why Benchmarks Aren't Enough: Eve Fleisig on Sociolinguistics & AI Evaluation | NeurIPS 2025

Автор: SAIL Media

Загружено: 2025-12-26

Просмотров: 4

Описание:

Is Natural Language Processing (NLP) dead, or has it just evolved? At NeurIPS 2025, Jessica Dai sits down with Eve Fleisig from ‪@BerkeleyEECS‬ to discuss the shifting culture of machine learning and why "loving words" still matters in the age of LLMs.

In this interview, Eve breaks down the critical difference between benchmarks (fixed proxies for optimization) and true evaluations (measuring real-world, downstream impacts). She explores how sociolinguistics can help us detect subtle harms like dialect discrimination—where models discriminate based on how you speak rather than just who you are—and why we need to move beyond optimizing for a single number.

Timestamps:
0:00 - The NeurIPS experience & where did the "NLP" go?
1:32 - The challenge of evaluation when humans disagree
2:17 - Bringing linguistics back: Syntax vs. Sociolinguistics
3:06 - The hidden harm of Dialect Discrimination
4:45 - The optimization trap: Are our proxies good enough?
5:40 - The crucial distinction: Benchmarks vs. Evaluations
6:50 - The future: Measuring long-term downstream impacts

#NeurIPS2025 #AI #NLP #MachineLearning #Sociolinguistics #EthicalAI #LLM

Why Benchmarks Aren't Enough: Eve Fleisig on Sociolinguistics & AI Evaluation | NeurIPS 2025

Поделиться в:

Доступные форматы для скачивания:

Скачать видео mp4

  • Информация по загрузке:

Скачать аудио mp3

Похожие видео

Diffusion Models That Speak: Discrete Latent Codes for Images w/ Michael Noukhovitch

Diffusion Models That Speak: Discrete Latent Codes for Images w/ Michael Noukhovitch

Ant Group's Open Source 1T Reasoning Model & The Future of AGI

Ant Group's Open Source 1T Reasoning Model & The Future of AGI

Do We Make Better Decisions with AI? Human Bias & Interpretability

Do We Make Better Decisions with AI? Human Bias & Interpretability

The State of AI: Gemini vs. Claude, DeepSeek, and The NeurIPS Hangover

The State of AI: Gemini vs. Claude, DeepSeek, and The NeurIPS Hangover

Рождественский вечер

Рождественский вечер "КАПРИС собирает друзей" – г.Минск, Лошицкая усадьба, 20.12.2025

Energy Storage, But Make It Complicated

Energy Storage, But Make It Complicated

Huge Breakthrough: We're Beyond Silicon

Huge Breakthrough: We're Beyond Silicon

Can We Make AI Safer by Deleting Data? | Geodesic Research at NeurIPS 2025

Can We Make AI Safer by Deleting Data? | Geodesic Research at NeurIPS 2025

Who Are Circassians? And Why Russians Ki!!ed 90% of Their Population?

Who Are Circassians? And Why Russians Ki!!ed 90% of Their Population?

To, co Mongołowie zrobili z rodziną królewską Bagdadu, wstrząśnie tobą.

To, co Mongołowie zrobili z rodziną królewską Bagdadu, wstrząśnie tobą.

KONIEC „Made in Germany”! Volkswagen zamyka fabrykę, a Europa gaśnie!

KONIEC „Made in Germany”! Volkswagen zamyka fabrykę, a Europa gaśnie!

Pijana Polska Bieruta. Jak alkohol niszczył kraj po 1945 roku. Meliny, bimbrownicy i propaganda.

Pijana Polska Bieruta. Jak alkohol niszczył kraj po 1945 roku. Meliny, bimbrownicy i propaganda.

How a 54-Year-Old Gen X Creator Built a $8.5K Month With UGC

How a 54-Year-Old Gen X Creator Built a $8.5K Month With UGC

Rymanowski, Lewandowski: Prawdziwy Lewandowski

Rymanowski, Lewandowski: Prawdziwy Lewandowski

Dzisiaj Informacje Telewizja Republika 26.12.2025 | TV Republika

Dzisiaj Informacje Telewizja Republika 26.12.2025 | TV Republika

Najgroźniejszy lek bez recepty ?! Prawie wszyscy go biorą…

Najgroźniejszy lek bez recepty ?! Prawie wszyscy go biorą…

Is OpenAI a Bubble? Here's the 2026 Test (Unit Economics + Compute + Enterprise Proof)

Is OpenAI a Bubble? Here's the 2026 Test (Unit Economics + Compute + Enterprise Proof)

AI News: 28 Headlines No One Expected

AI News: 28 Headlines No One Expected

Ziemkiewicz, Dymek: Koniec starej epoki?  Co 2025 rok ujawnił o Polsce, USA i nowym układzie sił?

Ziemkiewicz, Dymek: Koniec starej epoki? Co 2025 rok ujawnił o Polsce, USA i nowym układzie sił?

Creator of AI WARNS: “You Won't Believe The Truth

Creator of AI WARNS: “You Won't Believe The Truth"

© 2025 dtub. Все права защищены.



  • Контакты
  • О нас
  • Политика конфиденциальности



Контакты для правообладателей: [email protected]