Goodfire
Our mission is to advance humanity's understanding of AI by examining the inner workings of advanced AI models (or "AI Interpretability"). As an applied research lab, we bridge the gap between theoretical science and practical applications of interpretability to build safer and more reliable AI models.
In-Context Learning & "Model Systems" Interpretability (Stanford lecture 3) - Ekdeep Singh Lubana
Computational Motifs (Stanford lecture 2) - Jack Merullo
Causal Mechanistic Interpretability (Stanford lecture 1) - Atticus Geiger
Paint with Ember
Goodfire Ember Quickstart
Goodfire Ember Auto Steer Demo
Editing Llama to be conscious - Goodfire's research preview sneak peek
The power of feature steering - Goodfire's research preview sneak peek
Introducing Goodfire's research preview