Sasha Rush
I give technical talks and courses on LLMs and Deep Learning.
Professor at Cornell.
AI World Models (Keyon Vafa)
Размышления в дикой природе — Вэньтин Чжао
Linear Attention and Beyond (Interactive Tutorial with Songlin Yang)
Compute-Constrained Data Selection (Junjie Oscar Yin)
How DeepSeek Changes the LLM Story
Стоит ли мне стать постдоком? - Нилуфар Мирешгалла
Speculations on Test-Time Scaling (o1)
Long-Context LLM Extension
Hands on Human-AI Coding
The Mamba in the Llama: Distilling and Accelerating Hybrid Models
Как написать хорошую исследовательскую работу.
Sewon Min - Rethinking Data Use in Large Language Models
Street Fighting Transformers
Simple Diffusion Language Models
Hao Zhang - Chatbot Arena (UCSD / LMSys)
Hanna Hajishirzi (AI2) - OLMo: Findings of Training an Open LM
MambaByte: Token-Free Language Modeling
Luca Soldaini - Curating Pretrain Data (AI2 / Dolma)
Do we need Attention? A Mamba Primer
Swabha Swayamdipta: Towards (Closed-Source) LLM Accountability via Logit Signatures (USC)
Ying Sheng - Bridging human and LLM systems
Tatsu Hashimoto - Lessons from the Alpaca Project (Stanford)
Louis Castricato - RLAIF, User Autonomy, and Controllability (Eleuther / Synthlabs)
Eugene Cheah - From idea to LLM (RWKV / Recursal)
Daphne Ippolito (CMU / Google) - No One-Size Fits All Pre-Training Data
Ludwig Schmidt - Open source AI for Multimodality
Leshem Chosen - Wiki-models through Natural Feedback
Irina Rish (Mila) - Continual Learning of Foundation Models
Niklas Muennighoff - From GPU poor to poor GPU rich
Graham Neubig (CMU) - Can we make building with open-source AI as simple as prompting ChatGPT?