Safetensors
llama
zkshan2002 commited on
Commit
f7897f4
·
verified ·
1 Parent(s): bc49bd8

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ datasets:
3
+ - weqweasdas/ultra_train
4
+ base_model:
5
+ - OpenRLHF/Llama-3-8b-sft-mixture
6
+ ---
7
+ Base Model: [OpenRLHF/Llama-3-8b-sft-mixture](https://huggingface.co/OpenRLHF/Llama-3-8b-sft-mixture)
8
+
9
+ Reward model: [RTO-RL/Llama3-8B-RewardModel](https://huggingface.co/RTO-RL/Llama3-8B-RewardModel)
10
+
11
+ Prompt dataset: [weqweasdas/ultra_train](https://huggingface.co/datasets/weqweasdas/ultra_train)