Update README.md
Browse files
README.md
CHANGED
@@ -4,4 +4,8 @@ datasets:
|
|
4 |
base_model:
|
5 |
- unsloth/Llama-3.2-1B-Instruct
|
6 |
---
|
7 |
-
|
|
|
|
|
|
|
|
|
|
4 |
base_model:
|
5 |
- unsloth/Llama-3.2-1B-Instruct
|
6 |
---
|
7 |
+
Base mode: [unsloth/Llama-3.2-1B-Instruct](https://huggingface.co/unsloth/Llama-3.2-1B-Instruct)
|
8 |
+
|
9 |
+
Tokenizer: [OpenRLHF/Llama-3-8b-sft-mixture](https://huggingface.co/OpenRLHF/Llama-3-8b-sft-mixture)
|
10 |
+
|
11 |
+
Preference dataset: [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized)
|