Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ Zora Che*, Stephen Casper*,
|
|
15 |
Robert Kirk, Anirudh Satheesh, Stewart Slocum, Lev E McKinney, Rohit Gandikota, Aidan Ewart, Domenic Rosati, Zichu Wu, Zikui Cai, Bilal Chughtai,
|
16 |
Yarin Gal, Furong Huang, Dylan Hadfield-Menell
|
17 |
|
18 |
-
Paper:
|
19 |
|
20 |
BibTeX:
|
21 |
```
|
|
|
15 |
Robert Kirk, Anirudh Satheesh, Stewart Slocum, Lev E McKinney, Rohit Gandikota, Aidan Ewart, Domenic Rosati, Zichu Wu, Zikui Cai, Bilal Chughtai,
|
16 |
Yarin Gal, Furong Huang, Dylan Hadfield-Menell
|
17 |
|
18 |
+
Paper: [Model Tampering Attacks Enable More Rigorous Evaluations of LLM Capabilities](https://arxiv.org/abs/2502.05209)
|
19 |
|
20 |
BibTeX:
|
21 |
```
|