Commonsense Reasoning in the Wild (Xiang Ren)

Автор: HiTZ zentroa

Загружено: 2022-10-07

Просмотров: 267

Описание:

Current NLP systems impress us by achieving close-to-human performance on benchmarks of answering commonsense questions or writing interesting stories. However, most of the progress is evaluated using static, closed-ended datasets created for individual tasks. To deploy commonsense reasoning services in the wild, we look to develop and evaluate systems that can generate answers in an open-ended way, perform robust logical reasoning, and generalize across diverse task formats, domains, and datasets. In this talk I will share our effort on introducing new formulations of commonsense reasoning challenges and novel evaluation protocols, towards broadening the scope in approaching machine common sense. We hope that such a shift of evaluation paradigm would encourage more research on externalizing the model reasoning process and improving model robustness and cross-task generalization.

Commonsense Reasoning in the Wild (Xiang Ren)

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Incorporating Commonsense Reasoning into NLP Models (Vered Shwartz)

Incorporating Commonsense Reasoning into NLP Models (Vered Shwartz)

Safer Generative ConvAI - Pascale Fung (The Hong Kong University of Science and Technology)

Safer Generative ConvAI - Pascale Fung (The Hong Kong University of Science and Technology)

Turning IT Frustrations to Freedom: A Case Study with RIA Workspace

Turning IT Frustrations to Freedom: A Case Study with RIA Workspace

Multilingual LLM Evaluation in Practical Settings - Sebastian Ruder (Meta)

Multilingual LLM Evaluation in Practical Settings - Sebastian Ruder (Meta)

Prompting is *not* all you need! Or why Multi-LLM Collaboration Matters-Mirella Lapata (Edin)

Prompting is *not* all you need! Or why Multi-LLM Collaboration Matters-Mirella Lapata (Edin)

LEXam: Сравнительный анализ навыков юридического мышления на 340 экзаменах по праву.

LEXam: Сравнительный анализ навыков юридического мышления на 340 экзаменах по праву.

2022-2023

Erich Hartmann. Jak as wszech czasów trafił w ręce Sowietów?

Erich Hartmann. Jak as wszech czasów trafił w ręce Sowietów?

Is AI a Game Changer or a Threat in Insurance? - EP 406 - Bernhard Kotanko and Julien Condamines

Is AI a Game Changer or a Threat in Insurance? - EP 406 - Bernhard Kotanko and Julien Condamines

[REFAI Seminar 12/11/25] From Sparse Pattern to Smart Acceleration: ML Methods for Future of Compute

[REFAI Seminar 12/11/25] From Sparse Pattern to Smart Acceleration: ML Methods for Future of Compute

Music & AI (Fall 2025)

Music & AI (Fall 2025)

Meaning making with artificial interlocutors and risks of language technology-Emily M. Bender (UW)

Meaning making with artificial interlocutors and risks of language technology-Emily M. Bender (UW)

Arman Cohan - Evaluating and Understanding LLMs: From Scientific Reasoning to Alignment as Judges

Arman Cohan - Evaluating and Understanding LLMs: From Scientific Reasoning to Alignment as Judges

Speech neuroprostheses based on intracranial EEG - Christian Herff (Maastricht University)

Speech neuroprostheses based on intracranial EEG - Christian Herff (Maastricht University)

LATXA hizkuntza-eredu eta txatbota

LATXA hizkuntza-eredu eta txatbota

JUST IN: Canada's Rail Routes Are Wiping Out U.S. Grain Export Leverage!

JUST IN: Canada's Rail Routes Are Wiping Out U.S. Grain Export Leverage!

Stealthy Fingerprinting Users with Personalized Blocking Rules - Elhaj Sheda Hard

Stealthy Fingerprinting Users with Personalized Blocking Rules - Elhaj Sheda Hard

The Mímir Project: Impact of copyrighted materials in LLMs - Javier de la RosaJavier de la Rosa

The Mímir Project: Impact of copyrighted materials in LLMs - Javier de la RosaJavier de la Rosa

Large Reasoning Models Connect Session - 10th December 2025

Large Reasoning Models Connect Session - 10th December 2025

Podaj Paczkę 🎁 - Pełne odcinki 📺 | Seria 3 💙 | Blue - Oficjalny Polski Kanał

Podaj Paczkę 🎁 - Pełne odcinki 📺 | Seria 3 💙 | Blue - Oficjalny Polski Kanał