Issue No. 001AI Safety•April 30, 2026•8 min read
The Black Box Has a Group Chat
Mechanistic interpretability, AI safety, and my first blog
Mechanistic interpretability is the field of asking what actually happens inside the model, and why that matters once the systems stop being toys and start becoming infrastructure.
Read post