Update README.md
Browse files
README.md
CHANGED
@@ -195,7 +195,7 @@ Dataset: [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets
|
|
195 |
<br><br>
|
196 |
|
197 |
## Training Procedure
|
198 |
-
Both Pre-Train and Finetuning used [our fork](https://github.com/Pints-AI/1.5-Pints) of the [LitGPT Framework](https://github.com/Lightning-AI/litgpt). For DPO, we used the methods set out in [The Alignment Handbook](https://github.com/huggingface/alignment-handbook/blob/main/scripts/run_dpo.py). More details can be found in our [paper](
|
199 |
|
200 |
## Training Hyperparameters
|
201 |
**Pre-Train**<br>
|
|
|
195 |
<br><br>
|
196 |
|
197 |
## Training Procedure
|
198 |
+
Both Pre-Train and Finetuning used [our fork](https://github.com/Pints-AI/1.5-Pints) of the [LitGPT Framework](https://github.com/Lightning-AI/litgpt). For DPO, we used the methods set out in [The Alignment Handbook](https://github.com/huggingface/alignment-handbook/blob/main/scripts/run_dpo.py). More details can be found in our [paper](https://arxiv.org/abs/2408.03506).
|
199 |
|
200 |
## Training Hyperparameters
|
201 |
**Pre-Train**<br>
|