AI & ML interests

Principled evaluation of mechanistic interpretability methods.

Recent Activity

amueller  updated a Space 3 days ago
mib-bench/leaderboard
amueller  updated a model 3 days ago
mib-bench/mib-circuits-example
amueller  updated a Space 5 days ago
mib-bench/leaderboard
View all activity