Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
cais 's Collections
HarmBench Classifiers
WMDP Benchmark

WMDP Benchmark

updated May 29

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Upvote
7

  • The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

    Paper • 2403.03218 • Published Mar 5, 2024 • 1

  • cais/wmdp

    Viewer • Updated Apr 27, 2024 • 3.67k • 7.17k • 20

  • cais/wmdp-bio-forget-corpus

    Viewer • Updated May 29 • 24.5k • 403

  • cais/wmdp-cyber-forget-corpus

    Viewer • Updated May 29 • 1k • 130 • 1

  • cais/wmdp-corpora

    Viewer • Updated Apr 25, 2024 • 66.4k • 505 • 3

  • cais/wmdp-mmlu-auxiliary-corpora

    Viewer • Updated Apr 25, 2024 • 8.88k • 39 • 2

  • cais/Zephyr_RMU

    Text Generation • 7B • Updated Apr 24, 2024 • 83 • 3

  • cais/Mixtral-8x7B-Instruct_RMU

    Text Generation • 47B • Updated Apr 24, 2024 • 5 • 2

  • cais/Yi-34B-Chat_RMU

    Text Generation • 34B • Updated Apr 24, 2024 • 5
Upvote
7
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs