LLM Evals - Part 1: Evaluating Performance
Автор: Trelis Research
Загружено: Дата премьеры: 30 дек. 2024 г.
Просмотров: 2 973 просмотра
➡️ Get access to the ADVANCED-Evals Repo (incl. future additions): https://trelis.com/ADVANCED-evals/
➡️ https://docs.google.com/presentation/...
➡️ Thumbnail made with this tutorial: • Fine Tune Flux Diffusion Models with ...
OTHER TRELIS LINKS:
➡️ Explore Developer Tools/Scripts: https://Trelis.com/
➡️ Trelis Newsletter: https://trelis.substack.com
➡️ Collaborate with Trelis: https://Trelis.com/developer-collabor...
➡️ Consulting: https://Trelis.com/
TIMESTAMPS:
00:00 Introduction to LLM Evaluation
03:21 Understanding Evaluation Pipelines
09:56 Building a Demo Application
15:21 Creating Evaluation Datasets
23:52 Practical Evaluation Task / Question Development
27:40 Running and Analyzing Evaluations
30:24 Comparing LLM Model Performance using Evals
34:09 Conclusion and Next Steps

Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: