Introduction to Interpretability Now What

Let's dive into the details surrounding Interpretability Now What. Been Kim (Google Brain) https://simons.berkeley.edu/talks/tbd-72 Frontiers of Deep Learning.

Interpretability Now What Comprehensive Overview

This is a talk I gave to my MATS 9.0 training scholars about the big picture of mech interp - as of Oct 2025, what had changed? A surprising fact about modern large language models is that nobody really knows how they work internally. At Anthropic, the ... Interpretable

Lex Fridman Podcast full episode: https://www.youtube.com/watch?v=ugvHCXCOmm4 Thank you for listening ❤ Check out our ...

Summary & Highlights for Interpretability Now What

  • Art by @hamishdoodles Clipped from episode 19 of AXRP: https://youtu.be/3YbE7zybc5k?t=64 Transcript of that episode: ...
  • Take your personal data back with Incogni! Use code WELCHLABS at the link below and get 60% off an annual plan: ...
  • How can we reverse engineer what a neural network is doing? In this IASEAI '25 session, An Introduction to Mechanistic ...
  • What's happening inside an AI model as it thinks? Why are AI models sycophantic, and why do they hallucinate? Are AI models ...
  • Seminar on Theoretical Machine Learning Topic: Understanding Deep Neural Networks: From Generalization to

That wraps up our extensive overview of Interpretability Now What.

Interpretability Now What.pdf

Size: 2.33 MB · Format: PDF · Secure Download

Download PDF Read Online

Related Documents