Low Level Technicals of LLMs: Daniel Han

Автор: AI Engineer

Загружено: 2024-07-31

Просмотров: 51639

Описание:

This workshop will be split into 3x one hour blocks:

How to analyze & fix LLMs - how to find and fix bugs in Gemma, Phi-3, Llama & tokenizers
Finetuning with Unsloth - continued pretraining, reward modelling, QLoRA & more
Deep dive into LLM technicals - hand deriving derivatives, SOTA finetuning tricks
It's recommended you have Python with Pytorch and Unsloth installed (or use online Google Colab / Kaggle). College level maths and programming would be helpful.

Recorded live in San Francisco at the AI Engineer World's Fair. See the full schedule of talks at https://www.ai.engineer/worldsfair/20... & join us at the AI Engineer World's Fair in 2025! Get your tickets today at https://ai.engineer/2025

About Daniel
Hey I'm Daniel, the algos guy behind Unsloth. I love making LLM training go fast! We're the guys who fixed 8 of Google's Gemma bugs, a 2048 SWA Phi-3 issue, found tokenization issues and fixed untrained tokens with Llama-3, and I run Unsloth with my brother Michael!

Our open source package makes finetuning of LLMs 2x faster and uses 70% less VRAM with no accuracy degradation. I used to work at NVIDIA making GPU algos go fast and helped NASA engineers process data from a Mars rover faster!

Low Level Technicals of LLMs: Daniel Han

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

Никаких вибраций: решение сложных проблем в сложных кодовых базах – Декс Хорти, HumanLayer

Никаких вибраций: решение сложных проблем в сложных кодовых базах – Декс Хорти, HumanLayer

Компиляторы в эпоху степеней магистра права — Джозеф Олокоба, Муна

Компиляторы в эпоху степеней магистра права — Джозеф Олокоба, Муна

In the Loop with ACB - November Session

In the Loop with ACB - November Session

[1hr Talk] Intro to Large Language Models

[1hr Talk] Intro to Large Language Models

Бросаю вызов гравитации — Кевин Хоу, Google DeepMind

Бросаю вызов гравитации — Кевин Хоу, Google DeepMind

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

[Full Workshop] Reinforcement Learning, Kernels, Reasoning, Quantization & Agents — Daniel Han

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

Towards unlimited contexts: faster-than-GPU sparse logarithmic attention on CPU - AI Engineer Paris

Towards unlimited contexts: faster-than-GPU sparse logarithmic attention on CPU - AI Engineer Paris

Создатель курсора – Ли Робинсон, Cursor

Создатель курсора – Ли Робинсон, Cursor

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

GraphRAG: союз графов знаний и RAG: Эмиль Эйфрем

GraphRAG: союз графов знаний и RAG: Эмиль Эйфрем

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Fine-tuning Large Language Models (LLMs) | w/ Example Code

How DeepSeek Rewrote the Transformer [MLA]

How DeepSeek Rewrote the Transformer [MLA]

Как мы создаем эффективных агентов: Барри Чжан, Anthropic

Как мы создаем эффективных агентов: Барри Чжан, Anthropic

Andrej Karpathy: Software Is Changing (Again)

Andrej Karpathy: Software Is Changing (Again)

Terence Tao at IMO 2024: AI and Mathematics

Terence Tao at IMO 2024: AI and Mathematics

От кошмаров без гражданства к надёжным агентам — Сэмюэл Колвин, Pydantic

От кошмаров без гражданства к надёжным агентам — Сэмюэл Колвин, Pydantic

ИИ не меняет *Ничего* — Дакс Раад, OpenCode

ИИ не меняет *Ничего* — Дакс Раад, OpenCode

Stanford Webinar - Agentic AI: A Progression of Language Model Usage

Stanford Webinar - Agentic AI: A Progression of Language Model Usage

EASIEST Way to Fine-Tune a LLM and Use It With Ollama

EASIEST Way to Fine-Tune a LLM and Use It With Ollama