Bee: 15M CoT Data, Pipeline, and 8B MLLM

Автор: AI Research Roundup

Загружено: 2025-10-16

Просмотров: 32

Описание:

In this AI Research Roundup episode, Alex discusses the paper:
'Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs'
The work tackles the performance gap in fully open multimodal LLMs by improving SFT data quality and boosting complex Chain-of-Thought coverage. It releases Honey-Data-15M (≈12.2M short-CoT, ≈2.7M long-CoT), plus HoneyPipe/DataStudio—an automated curation pipeline with deduplication, rule/model-based filtering, CoT enrichment, and LLM-as-a-judge verification. The dual-level CoT design routes medium items to large-scale short-CoT via Qwen2.5-VL and sends hard cases to long-CoT with stronger models, all verified by Qwen2.5-VL-72B. The suite is validated by training Bee-8B, demonstrating the pipeline’s effectiveness.
Paper URL: https://arxiv.org/abs/2510.13795

#AI #MachineLearning #DeepLearning #Multimodal #LLM #ChainOfThought #OpenSource #Dataset

Resources:
Hugging Face model: https://huggingface.co/Open-Bee/Bee-8...
Hugging Face model 2: https://huggingface.co/Open-Bee/Bee-8...

Bee: 15M CoT Data, Pipeline, and 8B MLLM

Доступные форматы для скачивания:

Скачать видео mp4

Информация по загрузке:

Скачать аудио mp3

Похожие видео

DASD-4B: Better Long-CoT Reasoning for Small LLMs

DASD-4B: Better Long-CoT Reasoning for Small LLMs

Означает ли V-JEPA конец эры LLM? Новое видение искусственного интеллекта от Яна Лекуна.

Означает ли V-JEPA конец эры LLM? Новое видение искусственного интеллекта от Яна Лекуна.

A Theory of the Mechanics of Information - Christopher Hazard

A Theory of the Mechanics of Information - Christopher Hazard

Day 120 – Vibe Coding an App Until I Make $1,000,000 | Revenue: $3,106.90

Day 120 – Vibe Coding an App Until I Make $1,000,000 | Revenue: $3,106.90

Jensen Huang Drops UNFILTERED Truth About AI and Jobs

Jensen Huang Drops UNFILTERED Truth About AI and Jobs

Sytuacja KRYTYCZNA Na Ukrainie! Wybuchła POTĘŻNA Afera Korupcyjna. Koniec ZELENSKIEGO?

Sytuacja KRYTYCZNA Na Ukrainie! Wybuchła POTĘŻNA Afera Korupcyjna. Koniec ZELENSKIEGO?

STEP3-VL-10B: High-Performance 10B Multimodal LLM

STEP3-VL-10B: High-Performance 10B Multimodal LLM

Oriol Saguillo - Unravelling the Probabilistic Forest: Arbitrage in Prediction Markets

Oriol Saguillo - Unravelling the Probabilistic Forest: Arbitrage in Prediction Markets

FERRAN ŁAMIE KOD, A YAMAL GASI ŚWIATŁO! CZY ONI JESZCZE KIEDYŚ PRZEGRAJĄ? | SKRÓT

FERRAN ŁAMIE KOD, A YAMAL GASI ŚWIATŁO! CZY ONI JESZCZE KIEDYŚ PRZEGRAJĄ? | SKRÓT

Here's EVERY Camera Movement Prompt To Level-Up Your AI Filmmaking

Here's EVERY Camera Movement Prompt To Level-Up Your AI Filmmaking

LMCache Office Hour 2025 01 08

LMCache Office Hour 2025 01 08

Spiking Brain-inspired Large Models

Spiking Brain-inspired Large Models

Mrozu feat. Julia Pietrucha - Anioły (Pojedynek - official promo video)

Mrozu feat. Julia Pietrucha - Anioły (Pojedynek - official promo video)

FEJKOWY STANOWSKI I KRÓLOWA. CHIŃSKI DEEPFAKE W KAŻDYM DOMU

FEJKOWY STANOWSKI I KRÓLOWA. CHIŃSKI DEEPFAKE W KAŻDYM DOMU

Wyjaśniamy o co chodzi z Grenlandią. Czy naprawdę może wybuchnąć wojna USA-Dania?

Wyjaśniamy o co chodzi z Grenlandią. Czy naprawdę może wybuchnąć wojna USA-Dania?

Spiking Brain-inspired Large Models

Spiking Brain-inspired Large Models

Prawdziwy Powód, Dlaczego Psy CIĘ LIŻĄ (Szokujące!)

Prawdziwy Powód, Dlaczego Psy CIĘ LIŻĄ (Szokujące!)

DanQing: 100M High-Quality Chinese Image-Text Pairs

DanQing: 100M High-Quality Chinese Image-Text Pairs

MHLA: Restoring Linear Attention Expressivity

MHLA: Restoring Linear Attention Expressivity

Scaling Multi-Turn Tool Use for LLM Agents

Scaling Multi-Turn Tool Use for LLM Agents