Safetensors
llama
zkshan2002 commited on
Commit
219c19c
·
verified ·
1 Parent(s): 6cffcae

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -4,4 +4,8 @@ datasets:
4
  base_model:
5
  - unsloth/Llama-3.2-1B-Instruct
6
  ---
7
- Trained with tokenizer of [OpenRLHF/Llama-3-8b-sft-mixture](https://huggingface.co/OpenRLHF/Llama-3-8b-sft-mixture).
 
 
 
 
 
4
  base_model:
5
  - unsloth/Llama-3.2-1B-Instruct
6
  ---
7
+ Base mode: [unsloth/Llama-3.2-1B-Instruct](https://huggingface.co/unsloth/Llama-3.2-1B-Instruct)
8
+
9
+ Tokenizer: [OpenRLHF/Llama-3-8b-sft-mixture](https://huggingface.co/OpenRLHF/Llama-3-8b-sft-mixture)
10
+
11
+ Preference dataset: [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized)