pints-ai
/

1.5-Pints-16K-v0.1

Text Generation

Model card Files Files and versions Community

lemousehunter commited on Aug 12, 2024

Commit

2d40ec4

·

verified ·

1 Parent(s): 9cdaf9a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -195,7 +195,7 @@ Dataset: [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets
 <br><br>
 ## Training Procedure
-Both Pre-Train and Finetuning used [our fork](https://github.com/Pints-AI/1.5-Pints) of the [LitGPT Framework](https://github.com/Lightning-AI/litgpt). For DPO, we used the methods set out in [The Alignment Handbook](https://github.com/huggingface/alignment-handbook/blob/main/scripts/run_dpo.py). More details can be found in our [paper](TOBEADDED).
 ## Training Hyperparameters
 **Pre-Train**<br>

 <br><br>
 ## Training Procedure
+Both Pre-Train and Finetuning used [our fork](https://github.com/Pints-AI/1.5-Pints) of the [LitGPT Framework](https://github.com/Lightning-AI/litgpt). For DPO, we used the methods set out in [The Alignment Handbook](https://github.com/huggingface/alignment-handbook/blob/main/scripts/run_dpo.py). More details can be found in our [paper](https://arxiv.org/abs/2408.03506).
 ## Training Hyperparameters
 **Pre-Train**<br>