How to optimize Amazon Bedrock | The Keys to AWS Optimization | S13 E3

Автор: The Keys to AWS Optimization

Загружено: 2025-05-02

Просмотров: 365

Описание:

The episode focuses on optimizing Amazon Bedrock, AWS’s managed AI service, with insights from FinOps and generative AI experts. The hosts clarify key AI terminology, distinguishing between traditional machine learning (ML), which relies on statistical modeling, and generative AI (GenAI), which can create new content and reason beyond its training data. They explain foundational models (FMs), large language models (LLMs), and tokens-the billing unit for LLM usage. Bedrock simplifies AI deployment by managing infrastructure and billing by tokens, offering flexibility through on-demand, provisioned, and batch pricing models. The discussion covers strategies for selecting the right model based on use case, cost, and performance, emphasizing the importance of understanding Bedrock’s native and marketplace model billing in AWS Cost Explorer. Optimization techniques include model distillation (creating smaller, faster models for specific tasks), fine-tuning (improving model performance for particular domains), latency optimization (paying a premium for faster responses), and prompt caching (reducing costs for repeated queries). The episode also introduces retrieval-augmented generation (RAG), which enhances model outputs with external data via Bedrock knowledge bases, and highlights the need for careful cost management of related AWS resources. Finally, the guests share tools and best practices for evaluating model performance, ensuring responsible AI use, and maximizing ROI from GenAI investments.

https://docs.aws.amazon.com/bedrock/l...
https://aws.amazon.com/blogs/machine-...
https://docs.aws.amazon.com/sagemaker...
https://aws.amazon.com/blogs/aws/redu...

/ david-tepper

How to optimize Amazon Bedrock | The Keys to AWS Optimization | S13 E3

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

How to cost optimization GenAI | The Keys to AWS Optimization | S13 E4

How to cost optimization GenAI | The Keys to AWS Optimization | S13 E4

RAG vs Fine-Tuning vs Prompt Engineering: Optimizing AI Models

RAG vs Fine-Tuning vs Prompt Engineering: Optimizing AI Models

The Man Behind Google's AI Machine | Demis Hassabis Interview

The Man Behind Google's AI Machine | Demis Hassabis Interview

AWS re:Invent 2024 - Build scalable RAG applications using Amazon Bedrock Knowledge Bases (AIM305)

AWS re:Invent 2024 - Build scalable RAG applications using Amazon Bedrock Knowledge Bases (AIM305)

Почему RAG терпит неудачу — как CLaRa устраняет свой главный недостаток

Почему RAG терпит неудачу — как CLaRa устраняет свой главный недостаток

Fine-Tuning LLM Made Easy

Fine-Tuning LLM Made Easy

Большинство разработчиков не понимают, как работают контекстные окна.

Большинство разработчиков не понимают, как работают контекстные окна.

Вайб-кодинг в Cursor AI: полный гайд + реальный пример проекта (подходы, техники, трюки)

Вайб-кодинг в Cursor AI: полный гайд + реальный пример проекта (подходы, техники, трюки)

EASIEST Way to Train LLM Train w/ unsloth (2x faster with 70% less GPU memory required)

EASIEST Way to Train LLM Train w/ unsloth (2x faster with 70% less GPU memory required)

How to use the AWS MCPs to optimize your code | The Keys to AWS Optimization | S14 E9

How to use the AWS MCPs to optimize your code | The Keys to AWS Optimization | S14 E9

A beginners guide to Amazon Bedrock, Amazon Bedrock Knowledge Bases, Guardrails and Security

A beginners guide to Amazon Bedrock, Amazon Bedrock Knowledge Bases, Guardrails and Security

Building Cloud Cost Governance That Lasts | The Keys to AWS Optimization | S14 E10

Building Cloud Cost Governance That Lasts | The Keys to AWS Optimization | S14 E10

ИИ - ЭТО ИЛЛЮЗИЯ ИНТЕЛЛЕКТА. Но что он такое и почему совершил революцию?

ИИ - ЭТО ИЛЛЮЗИЯ ИНТЕЛЛЕКТА. Но что он такое и почему совершил революцию?

Big Data Rules For AI: Essential Data Management Principles

Big Data Rules For AI: Essential Data Management Principles

Spin up custom RAG agent on AWS Bedrock in just 20 minutes!

Spin up custom RAG agent on AWS Bedrock in just 20 minutes!

The Do's and Don'ts of Dashboarding | The Keys to AWS Optimization | S13 E1

The Do's and Don'ts of Dashboarding | The Keys to AWS Optimization | S13 E1

БЕЛЫЕ СПИСКИ: какой VPN-протокол справится? Сравниваю все

БЕЛЫЕ СПИСКИ: какой VPN-протокол справится? Сравниваю все

Модель контекстного протокола (MCP), четко объясненная (почему это важно)

Модель контекстного протокола (MCP), четко объясненная (почему это важно)

Claude Code: полный гайд по AI-кодингу (хаки, техники и секреты)

Claude Code: полный гайд по AI-кодингу (хаки, техники и секреты)

Fine-Tuning your Foundation Model in Amazon Bedrock | Amazon Web Services

Fine-Tuning your Foundation Model in Amazon Bedrock | Amazon Web Services