Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

Автор: Chris Hay

Загружено: 2024-03-10

Просмотров: 8130

Описание:

We look deep into the AI and look at how the embeddings layer of a Large Language Model such as Mistral-7B and Gemma-2B actually works.

You will learn how tokens and embeddings work and even extract out and load the embeddings layer from Gemma and Mistral into your own simple model, which we will use to visualize the model

You will see how an AI clusters terms together and how it can cluster similar words, build connections which cover not just similar words but also grouping of concepts such as colors, hotel chains, programming terms.

If you really want to understand how an LLM's works or even build your own LLM then starting with the first layer of a Generative AI model is the best place to start.

Github
-----------
https://github.com/chrishayuk/embeddings

Inside the LLM: Visualizing the Embeddings Layer of Mistral-7B and Gemma-2B

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

HuggingFace Fundamentals with LLM's such as TInyLlama and Mistral 7B

Появляется новый тип искусственного интеллекта, и он лучше, чем LLMS?

Появляется новый тип искусственного интеллекта, и он лучше, чем LLMS?

Что такое встраивание слов?

Что такое встраивание слов?

They Just Built a New Form of AI, and It’s Better Than LLMs

They Just Built a New Form of AI, and It’s Better Than LLMs

Qwen3 + MCP

Что такое Skills?!

Что такое Skills?!

GraphRAG: союз графов знаний и RAG: Эмиль Эйфрем

GraphRAG: союз графов знаний и RAG: Эмиль Эйфрем

Почему «Трансформеры» заменяют CNN?

Почему «Трансформеры» заменяют CNN?

GraphRAG: графы знаний, полученные с помощью LLM, для RAG

GraphRAG: графы знаний, полученные с помощью LLM, для RAG

Краткое объяснение больших языковых моделей

Краткое объяснение больших языковых моделей

Claude 4: MoE model?

Claude 4: MoE model?

Запуск нейросетей локально. Генерируем - ВСЁ

Запуск нейросетей локально. Генерируем - ВСЁ

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

LLM и GPT - как работают большие языковые модели? Визуальное введение в трансформеры

Большинство разработчиков не понимают, как работают токены LLM.

Большинство разработчиков не понимают, как работают токены LLM.

Mistral 8x7B Part 1- So What is a Mixture of Experts Model?

Mistral 8x7B Part 1- So What is a Mixture of Experts Model?

Как я учусь в 10 раз быстрее: Система с AI, Perplexity и NotebookLM

Как я учусь в 10 раз быстрее: Система с AI, Perplexity и NotebookLM

Merge LLMs to Make Best Performing AI Model

Merge LLMs to Make Best Performing AI Model

Google’s A2A Protocol (agent to agent)

Google’s A2A Protocol (agent to agent)

Токены LLM упали на 98%, OpenAI ищет героя, ИИ-мусор на $117 млн

Токены LLM упали на 98%, OpenAI ищет героя, ИИ-мусор на $117 млн

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24