A Primer on the Inner Workings of Transformer-based Language Models Paper • 2405.00208 • Published Apr 30, 2024 • 9
LM Transparency Tool: Interactive Tool for Analyzing Transformer Language Models Paper • 2404.07004 • Published Apr 10, 2024 • 6
Calibrating Reasoning in Language Models with Internal Consistency Paper • 2405.18711 • Published May 29, 2024 • 6
🔍 Interpretability & Analysis of LMs Collection Outstanding research in LM interpretability and evaluation, summarized • 101 items • Updated about 18 hours ago • 97