How to nitpick multimodal AI evaluations (CVPR 2025 Tutorial Excerpt)
Автор: Michael Saxon (NLP & Generative AI research)
Загружено: 2025-06-11
Просмотров: 68
My part of the 2025 CVPR tutorial, "Evaluating Large Multi-modal Models: Challenges and Methods"
https://lmm-understand.github.io/
Papers covered:
1. Aditya Sharma*, Michael Saxon*, William Yang Wang, " Losing Visual Needles in Image Haystacks: Vision Language Models are Easily Distracted in Short and Long Contexts", findings of EMNLP 2024
https://aclanthology.org/2024.finding...
2. Michael Saxon*, Fatima Jahara*, Mahsa Khoshnoodi*, Yujie Lu, Aditya Sharma, William Yang Wang. " Who Evaluates the Evaluations? Objectively Scoring Text-to-Image Prompt Coherence Metrics with T2IScoreScore (TS2)," NeurIPS 2024 Spotlight
https://openreview.net/forum?id=S4YRC...

Доступные форматы для скачивания:
Скачать видео mp4
-
Информация по загрузке: