lemousehunter commited on
Commit
2d40ec4
·
verified ·
1 Parent(s): 9cdaf9a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -195,7 +195,7 @@ Dataset: [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets
195
  <br><br>
196
 
197
  ## Training Procedure
198
- Both Pre-Train and Finetuning used [our fork](https://github.com/Pints-AI/1.5-Pints) of the [LitGPT Framework](https://github.com/Lightning-AI/litgpt). For DPO, we used the methods set out in [The Alignment Handbook](https://github.com/huggingface/alignment-handbook/blob/main/scripts/run_dpo.py). More details can be found in our [paper](TOBEADDED).
199
 
200
  ## Training Hyperparameters
201
  **Pre-Train**<br>
 
195
  <br><br>
196
 
197
  ## Training Procedure
198
+ Both Pre-Train and Finetuning used [our fork](https://github.com/Pints-AI/1.5-Pints) of the [LitGPT Framework](https://github.com/Lightning-AI/litgpt). For DPO, we used the methods set out in [The Alignment Handbook](https://github.com/huggingface/alignment-handbook/blob/main/scripts/run_dpo.py). More details can be found in our [paper](https://arxiv.org/abs/2408.03506).
199
 
200
  ## Training Hyperparameters
201
  **Pre-Train**<br>