stecas commited on
Commit
4ba164b
·
verified ·
1 Parent(s): 8c28853

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ Zora Che*, Stephen Casper*,
15
  Robert Kirk, Anirudh Satheesh, Stewart Slocum, Lev E McKinney, Rohit Gandikota, Aidan Ewart, Domenic Rosati, Zichu Wu, Zikui Cai, Bilal Chughtai,
16
  Yarin Gal, Furong Huang, Dylan Hadfield-Menell
17
 
18
- Paper: COMING SOON
19
 
20
  BibTeX:
21
  ```
 
15
  Robert Kirk, Anirudh Satheesh, Stewart Slocum, Lev E McKinney, Rohit Gandikota, Aidan Ewart, Domenic Rosati, Zichu Wu, Zikui Cai, Bilal Chughtai,
16
  Yarin Gal, Furong Huang, Dylan Hadfield-Menell
17
 
18
+ Paper: [Model Tampering Attacks Enable More Rigorous Evaluations of LLM Capabilities](https://arxiv.org/abs/2502.05209)
19
 
20
  BibTeX:
21
  ```