Activation Space Interventions Can Be Transferred Between Large Language Models Paper • 2503.04429 • Published Mar 6 • 2
TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research Paper • 2503.12730 • Published Mar 17 • 2
Can Large Language Models Infer Causation from Correlation? Paper • 2306.05836 • Published Jun 9, 2023 • 6
OPT-R: Exploring the Role of Explanations in Finetuning and Prompting for Reasoning Skills of Large Language Models Paper • 2305.12001 • Published May 19, 2023 • 1