Mghao
commited on
Commit
·
6ed4edd
1
Parent(s):
bda7b7d
update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ pipeline_tag: text-classification
|
|
10 |
# INF Outcome Reward Model
|
11 |
## Introduction
|
12 |
|
13 |
-
[**INF-ORM-Llama3.1-70B**](https://huggingface.co/Skywork/Skywork-Reward-Gemma-2-27B-v0.2) is the outcome reward model roughly built on the [Llama-3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct) architecture and trained with the dataset [INF-ORM-Preference-Magnitude-80K]().
|
14 |
|
15 |
**Note: Train Details are coming soon!**
|
16 |
|
|
|
10 |
# INF Outcome Reward Model
|
11 |
## Introduction
|
12 |
|
13 |
+
[**INF-ORM-Llama3.1-70B**](https://huggingface.co/Skywork/Skywork-Reward-Gemma-2-27B-v0.2) is the outcome reward model roughly built on the [Llama-3.1-70B-Instruct](https://huggingface.co/meta-llama/Llama-3.1-70B-Instruct) architecture and trained with the dataset [INF-ORM-Preference-Magnitude-80K](https://huggingface.co/datasets/infly/INF-ORM-Preference-Magnitude-80K).
|
14 |
|
15 |
**Note: Train Details are coming soon!**
|
16 |
|