Spaces:
Running
Running
Commit
·
49a3cce
1
Parent(s):
19f0320
fix link
Browse files
src/md.py
CHANGED
@@ -101,6 +101,6 @@ current_time = datetime.now(pacific_tz).strftime("%H:%M %Z, %d %b %Y")
|
|
101 |
|
102 |
TOP_TEXT = f"""# RewardBench: Evaluating Reward Models
|
103 |
### Evaluating the capabilities, safety, and pitfalls of reward models
|
104 |
-
[Code](https://github.com/allenai/reward-bench) | [Eval. Dataset](https://huggingface.co/datasets/allenai/reward-bench-v2-v0) | [Prior Test Sets](https://huggingface.co/datasets/allenai/pref-test-sets) | [Results](https://huggingface.co/datasets/allenai/reward-bench-results
|
105 |
|
106 |
⚠️ Many of the top models were trained on unintentionally contaminated, AI-generated data, for more information, see this [gist](https://gist.github.com/natolambert/1aed306000c13e0e8c5bc17c1a5dd300)."""
|
|
|
101 |
|
102 |
TOP_TEXT = f"""# RewardBench: Evaluating Reward Models
|
103 |
### Evaluating the capabilities, safety, and pitfalls of reward models
|
104 |
+
[Code](https://github.com/allenai/reward-bench) | [Eval. Dataset](https://huggingface.co/datasets/allenai/reward-bench-v2-v0) | [Prior Test Sets](https://huggingface.co/datasets/allenai/pref-test-sets) | [Results](https://huggingface.co/datasets/allenai/reward-bench-v2-results) | [Paper](https://arxiv.org/abs/2403.13787) | Total models: {{}} | * Unverified models | ⚠️ Dataset Contamination | Last restart (PST): {current_time}
|
105 |
|
106 |
⚠️ Many of the top models were trained on unintentionally contaminated, AI-generated data, for more information, see this [gist](https://gist.github.com/natolambert/1aed306000c13e0e8c5bc17c1a5dd300)."""
|