saumyamalik commited on
Commit
19f0320
·
1 Parent(s): 5883014

links go to v2

Browse files
Files changed (1) hide show
  1. src/md.py +1 -1
src/md.py CHANGED
@@ -101,6 +101,6 @@ current_time = datetime.now(pacific_tz).strftime("%H:%M %Z, %d %b %Y")
101
 
102
  TOP_TEXT = f"""# RewardBench: Evaluating Reward Models
103
  ### Evaluating the capabilities, safety, and pitfalls of reward models
104
- [Code](https://github.com/allenai/reward-bench) | [Eval. Dataset](https://huggingface.co/datasets/allenai/reward-bench) | [Prior Test Sets](https://huggingface.co/datasets/allenai/pref-test-sets) | [Results](https://huggingface.co/datasets/allenai/reward-bench-results) | [Paper](https://arxiv.org/abs/2403.13787) | Total models: {{}} | * Unverified models | ⚠️ Dataset Contamination | Last restart (PST): {current_time}
105
 
106
  ⚠️ Many of the top models were trained on unintentionally contaminated, AI-generated data, for more information, see this [gist](https://gist.github.com/natolambert/1aed306000c13e0e8c5bc17c1a5dd300)."""
 
101
 
102
  TOP_TEXT = f"""# RewardBench: Evaluating Reward Models
103
  ### Evaluating the capabilities, safety, and pitfalls of reward models
104
+ [Code](https://github.com/allenai/reward-bench) | [Eval. Dataset](https://huggingface.co/datasets/allenai/reward-bench-v2-v0) | [Prior Test Sets](https://huggingface.co/datasets/allenai/pref-test-sets) | [Results](https://huggingface.co/datasets/allenai/reward-bench-results-v2) | [Paper](https://arxiv.org/abs/2403.13787) | Total models: {{}} | * Unverified models | ⚠️ Dataset Contamination | Last restart (PST): {current_time}
105
 
106
  ⚠️ Many of the top models were trained on unintentionally contaminated, AI-generated data, for more information, see this [gist](https://gist.github.com/natolambert/1aed306000c13e0e8c5bc17c1a5dd300)."""