Make Smaller LLMs R1-Smart (UC Berkeley)

artificial intelligence

AI models

LLM

VLM

VLA

Multi-modal model

explanatory video

RAG

multi-AI

multi-agent

Fine-tune

Pre-train

RLHF

AI Agent

Multi-agent

Vision Language Model

Video AI

Автор: Discover AI

Загружено: 18 апр. 2025 г.

Просмотров: 4 305 просмотров

Описание:

How-to-Make Smaller LLMs R1-Smart (UC Berkeley)

All right w/ authors:
"Climbing the Ladder of Reasoning: What LLMs Can—and
Still Can’t—Solve after SFT?"
Yiyou Sun1, Georgia Zhou1, Hao Wang1, Dacheng Li1, Nouha Dziri2, Dawn Song1
1 University of California, Berkeley,
2 Allen Institute for AI
arXiv:2504.11741v1

‪@UCBerkeley‬

Make Smaller LLMs R1-Smart (UC Berkeley)

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

The mind behind Linux | Linus Torvalds | TED

The mind behind Linux | Linus Torvalds | TED

Cybersecurity Architecture: Five Principles to Follow (and One to Avoid)

Cybersecurity Architecture: Five Principles to Follow (and One to Avoid)

Deep & Melodic House 24/7: Relaxing Music • Chill Study Music

Deep & Melodic House 24/7: Relaxing Music • Chill Study Music

What's next for AI at DeepMind, Google's artificial intelligence lab | 60 Minutes

What's next for AI at DeepMind, Google's artificial intelligence lab | 60 Minutes

Biggest Puzzle in Computer Science: P vs. NP

Biggest Puzzle in Computer Science: P vs. NP

Programable Logic Controller Basics Explained - automation engineering

Programable Logic Controller Basics Explained - automation engineering

How to Start Coding | Programming for Beginners | Learn Coding | Intellipaat

How to Start Coding | Programming for Beginners | Learn Coding | Intellipaat

RAG vs. CAG: Solving Knowledge Gaps in AI Models

RAG vs. CAG: Solving Knowledge Gaps in AI Models

Как один человек уничтожил треть страны. Дикая история Пол Пота | ФАЙБ

Как один человек уничтожил треть страны. Дикая история Пол Пота | ФАЙБ

Using Agentic AI to create smarter solutions with multiple LLMs (step-by-step process)

Using Agentic AI to create smarter solutions with multiple LLMs (step-by-step process)